POLYPEPTIDES THAT BIND HIV gpl20 AND RELATED NUCLEIC 
ACIDS, ANTIBODIES, COMPOSITIONS, AND METHODS OF USE 



TECHNICAL FIELD OF THE INVENTION 
5 The present invention relates to polypeptides with 

homology to regions of domains of the hum^n chemokine 
receptors CCRS, CXCR4, and STRL33, as well as domains of 
CD4 that bind with human immunodeficiency virus (HIV),, in 
particular HIV-1 glycoprotein 120 (gpl20) envelope . 
10 protein. The present invention also relates to nucleic 
acids encoding such polypeptides, antibodies, 
compositions comprising such polypeptides, nucleic acids 
or antibodies, , and methods of using the same. 

15 BACKGROUND OF THE INVENTION 

There are seven transmembrane chemokine receptors 
that act as cof actors for HIV infection. The cof actors 
enable entry of HIV-1 into 004" T cells and macrophages 
(Premack et al.. Nature Medicine 2: 1174-78 (1996); and 
20 Zhang et al., i^ature 383: 768 (1996) > . 

The presence of chemokines has an inhibitoxy effect 
on HIV-1 attachment to, and infection of, susceptible 
cells. Additionally, some mutations in chemokine 
receptors have been shown to result in resistance to 
25 HIV-1 infection.. For example, a 32-nucleotide deletion 
within the CCR5 gene has been described in subjects who 
remained uninfected despite repeated exposures to HIV-1 
(Huang et al-. Nature Medicine 2: 1240-43 (1996)). 

Evidence also exists for the physical association of 
30 a ternary complex between chemokine receptors, CD4, and 
HIV--1 gpl20 envelope glycoprotein on cell membranes 
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(Lapham et al.. Science 274: 602-05 (1996)). Receptor 
signaling and cell activation are probably not required 
for the anti-HIV-1 effect of chemokines since a RANTES 
analog lacking the first eight amino -terminal amino 
5 acids, R2^NTES (9-68), lacked chemotactic and leiikocyte- 
activating properties, but bound to multiple chemokine 
, receptors and inhibited infection by macrophage -tropic 
HIV-1 (Arenzana-Seladedos et al., jMafcure 383: 400 
(1996) )• Cumulatively, the above described results 
10 suggest that the interaction between gpl20, CD4, and at 
13 least one chemokine receptor is obligatory for HIV-1 

infection. Accordingly, reagents that interfere with the 
binding of gpl20 to chemokine receptors and to CD4 are 
U used in the biological and medical arts. However, there 

15 presently exists a need for additional reagents that can 
y compete with one or more proteins of the 

gpl20-CD4- chemokine receptor cortplex to assist in basic 
biological or viral research, and to assist in medical 
intervention in the HIV-1 pandemic. It is an object of 
20 the present invention to provide such reagents. This and 
other objects and advantages, including additional 
inventive features, will be apparent from the description 
provided herein. 

25 BRIEF SUMMARY OP THE INVENTION 

The present invention provides a polypeptide that 
binds with HIV gpl20 under physiological conditions. 
Multiple embodiments of the present . inventive polypeptide 
are provided, and each embodiment possesses a degree of 
30 homology to at least one of the human CCR5, CXCR4 and 
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STRL33 chemokine receptors ^ and the human CD4 cell- 
surface protein. 

In a first embodiment, the present invention 
provides a polypeptide comprising the amino acid sequence 
5 YDIXYYXXE, wherein X is any synthetic or naturally 
occurring amino acid residue, and the polypeptide 
comprises less than about 100 contiguous amino acids that 
are identical to, or, in the alternative, substantially 
identical to, the amino acid sequence of the human CCR5 
.10 chemokine receptor. A preferred polypeptide of this 
first embodiment comprises the amino acid sequence 
y YDIN*YYT*S*E. A more preferred polypeptide of this first 

embodiment comprises the amino acid sequence YDINYYTSE, 
wherein each letter is the standard one-letter 
15 abbreviation for an amino acid residue (i.e., for 

example, N denotes asparaginyl, T denotyes threoninyl, 
and S denotes serinyl) ♦ The polypeptide of the first 
embodiment can comprise the amino acid sequence 
M*D*YQ*V*S*SP*IYDIN*YYT*S*E. Preferably, the polypeptide 
20 comprises the amino acid sequence MDYQVSSPIYDINYYTSE. 

In a second embodiment, the present invention 
provides a polypeptide comprising the amino acid sequence 
XBXIXIYXXXNYXXX, wherein X is any synthetic or naturally 
occurring amino acid and wherein said polypeptide 
25 comprises less than about 100 contiguous amino acid that 
are identical to or substantially identical to the amino 
acid sequence of the human CXCR4 chemokine receptor. The 
polypeptide can consist essentially of, or consist of, 
the sequence EXIXIYXXXNY. Preferably, the polypeptide 
30 comprises the sequence M*EG*IS*IYT*S*D*NYT*E*E* . 
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Preferably, M*EG*IS*IYT*S*D*NYT*E*E* is 
M*EGISIYTSDNYT*E*E* . 

In a third embodiment/ the present invention 
provides a polypeptide comprising the amino acid sequence 
EHQAFLQFS, wherein said polypeptide comprises less than 
about 100 contiguous amino acids that are identical to or 
siibstantially identical to the amino acid sequence of the 
human STRL33 chemokine receptor. The polypeptide can 
consist essentially of, or consist of, the sequence 
EHQAFLQFS. 

In a fourth embodiment, the present invention 
provides a polypeptide comprising at least a portion of 
an amino acid sequence selected from the group consisting 
of LPPLYSLVFIFGFVGNML, QWDFGNTMCQIiLTGLYFIGFFS , 
SQYQFWKNFQTLKIVILG, APYNI VLLLNTFQEFPGLNNCS , and 
YAFVGEKFRNYLLVFFQK, wherein said polypeptide coir^rises 
less than about 100 contiguous amino acids that are 
identical to or substantially identical to the amino acid 
sequence of the human CCR5 chemokine receptor. 

In a fifth embodiment, the present invention 
provides a polypeptide comprising at least a portion of 
an amino acid sequence selected from the group consisting 
of LLLTIPDFIFANVSEADD, WFQFQHIMVGLILPGIV, and 
IDSFILLEIIKQGCEPEN, wherein said polypeptide comprises 
less than about 100 contiguous amino acids that are 
identical to or stibstantially identical to the amino acid 
sequence of the human ■CXCR4 chemokine receptor. . 

In a sixth embodiment, the present invention 
provides a polypeptide comprising at least a portion of 
an amino acid sequence selected from the group consisting 
of LVISIFYHKLQSLTDVFL, PFWAYAGIHEWVFGQVMC, 



EAISTWLATQMTLGFFL, LTMIVCYSVIIKTLLHAG, 
MAWLLTQMPFNLMKFIRSTHW , HWEYYAMTSFHYTIMVTE , 
hCm-pVLYAFVSIjKFRKN and SKTFSASHNVEATSMFQL, wherein said 
polypeptide comprises less than about 100 contiguous 
amino acids that are identical to or substantially 
identical to the amino acid sequence of the human STRL33 
chemokine receptor. 

In a seventh embodiment, the present invention 
provides a polypeptide comprising at least a portion of 
an amino acid sequence selected from the group consisting 
of DTYICEVED, EEVQLLVFGLTANSD , THLLQGQSLTLTLES , and 
GEQVEFSPPIAFTyE, wherein said polypeptide comprises less 
than about 100 contiguous amino acids that are identical 
to or sixbstantially identical to the amino acid sequence 
of the human CD4 cell-surface protein. 

In the fourth to seventh embodiments, any selected 
portion of the polypeptide can comprise from 1 to about 6 
conservative amino acid substitutions. In an 
alternative, the polypeptide can be partially defined by 
an absence of a polypeptide sequence, outside the region - 
of the portion selected from the foregoing sequences, 
that has five, or ten, contiguous amino acid residues 
that have a sequence that consists of an amino acid 
sequence that is identical to or substantially identical 
to the protein to which the polypeptide has homology 
(i.e., CCR5, CXCR4, STRL33, or CD4) . In yet another 
alternative, the polypeptide can lack a sequence of five 
or ten contiguous amino acids which are identical to or 
substantially identical to the sequence of the protein 
with which the sequence has homology except that one or 
more consearvatively or neutrally substituted amino acids 



replace part of the sequence of the protein to which the 
polypeptide has homology. Additionally, any embodiment 
of the present inventive polypeptide can also comprise a 
pharmaceutically acceptable substituent. 

Any embodiment of the present inventive polypeptide 
can be incorporated into a composition, which further 
comprises a carrier. Any suitable embodiment of the 
present inventive polypeptide can be encoded by a nucleic 
acid that can be expressed in a cell. In this regard, 
the present invention further provides a vector 
comprising such a nucleic acid. The nucleic acids and 
vectors also can be incorporated into a composition 
comprising a carrier. 

Additionally, the present invention provides a 
method of making an antibody to a polypeptide of the 
present invention. The present invention also provides a 
method of prophylactically or therapeutically treating an 
HIV infection in a mammal. 

Additionally, the present invention provides an 
anti-idiotypic .antibody comprising an internal image of a 
portion of gpl20, as well as a method of selecting such 
an antibody. 

The present invention also provides a method of 
making an antibody to a portion of the gpl20 protein that 
binds with a portion of CCR5, CXCR4, STRL33, or CD4, as 
well as the immunizing compound used to make the 
antibody, and the antibody itself. In another embodiment 
of the pr'esent invention, a method of removing HIV-1 from 
a bodily fluid is provided. 



BRIEF DESCRIPTION OP THE DRAWINGS 
Figure 1 depicts a listing of synthetic amino acids 
available (from Bachem, King of Prussia, PA) for 
incorporation into polypeptides of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention provides a polypeptide that 
binds with gpl20 of HIV, in particular HIV-1, more 
particularly HIV- l^a/ under physiological conditions. 
The polypeptide has a number of uses including, but not 
limited to, the use of the polypeptide to elucidate the 
mechanism by which HIV, such as HIV-1, attaches to and/or 
infects a particular cell, to induce an immune response 
in a mammal, in particular a human, to HIV, in particular 
HIV-1, and to inhibit the replication of HIV, in 
particular HIV-1, in an infected mammal, in particular a 
human. 

Multiple embodiments of the present inventive 
polypeptide are provided. Each embodiment of the 
polypeptide has a degree of homology to at least one of 
the human CCR5, CXCR4 and STRL33 chemokine receptors, or 
the human CD4 cell-surface protein. In each embodiment 
provided herein, a letter indicates the standard amino 
acid designated by that letter, and a letter followed 
directly by an asterisk {*) preferably represents the 
amino acid represented by the letter (e.g., N represents 
asparaginyl and T represents threoninyl) , ot a synthetic 
or naturally occurring conservative or neutral . 
substitution therefor. Additionally, in accordance with 
convention, all amino acid sequences provided herein are 
given either from left to right, or top to bottom, such 
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that the first amino acid is amino- terminal and the last . 
is carboxyl-teorminal. The synthesis of polypeptides, 
either synthetically (i.e., chemically) or biologically, 
is within the skill in the art. 

It is within the skill of 'the ordinary artisan to 
select synthetic and naturally occurring amino acids that 
make conservative or neutral substitutions for any 
particular naturally occurring amino acids. The skilled 
artisan desirably will consider the context in which any 
particular amino acid substitution is made, in addition 
to considering the hydrophobicity or polarity of the 
side-chain, the general size of the side chain, and the 
pK value of side-chains with acidic or basic character 
under physiological conditions. For example, lysine, 
arginine, and histidine are often suitably substituted 
for each other, and more often arginine and lysine. As 
is known in the art, this is because all three amino 
acids have basic side chains, whereas the pK value for 
the side-chains of lysine and arginine are much closer to 
each other (about 10 and 12) than to histidine (about 6). 
Similarly, glycine, alanine, valine, leucine, and 
isoleucine are often suitably substituted for each other, 
with the proviso that glycine is frequently not suitably 
sxibstituted for the other members of the group. This is 
because each of these amino acids are relatively 
hydrophobic when incorporated into a polypeptide, but 
glycine's lack of an a- carbon allows the phi and psi 
angles of rotation (around the a- carbon) so much 
conformational freedom that glycinyl residues can trigger 
changes in conformation or secondary stiructure that do 
not often occur when the other amino acids are * 



substituted for each other. Other groups of amino acids 
frequently suitably substituted for each other include, 
but are not limited to, the group consisting of glutamic 
and aspartic acids; the group consisting of 
phenylalanine, tyrosine and tryptophan; and the group 
consisting of serine, threonine and, optionally, 
tyrosine. Additionally, the skilled artisan can readily 
group synthetic amino acids with naturally occurring 
amino acids. 

In the context of the present invention, a 
polypeptide is "substantially identical" to another 
polypeptide if it comprises at least about 80% identical 
amino acids. Desirably, at least about 50% of the 
non-identical amino acids are conservative or neutral 
substitutions. Also, desirably, the polypeptides differ 
in length (i.e., due to deletion mutations) by no more 
than about 10%. 

In a first embodiment, the present invention 
provides a polypeptide comprising the amino acid sequence 
YDIXYYXXEi wherein X is any synthetic or naturally 
occurring amino acid residue, and the polypeptide 
comprises less than about 100 contiguous amino acids, 
preferably less than about 50 amino acids, more 
preferably less than about 25 amino acids, and yet more 
preferably less than about 13 amino acids that are 
identical to, or, in the alternative, substantially 
identical to, the amino acid sequence of the human CCR5 
chemokine receptor. 

Preferably, the polypeptide of the first embodiment 
comprises YDIXYYXXE, wherein the amino moiety of the 
amino-terminal tyrosinyl residue is not bound to another 
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amino acid resi&e via a peptidic bond, and the carboKyl 
moiety of the glutamyl reeidue is riot bound to another 
amino acid residue via a peptidic bond. However, the 
polypeptide can consist essentially of YDIXYYXXE and, 
optionally, can be modified by one or more 
phaxmaceutically acceptable substituents , such as, for 
exa>n>le, t-boo or a saccharide. 

More particularly, the polypeptide comprises the 
amino acid sequence . Preferably, N. is 

asparaginyl, T. is threoninyl, and S* is serlnyl. 

The polypeptide of the first embodiment can conprise 
a dodecapeptide selected from the amino acid sequence 
„.D*YQ*V.S.SP*iroa>*YirT»S«E. More preferably, the 
polypeptide of the first embodiment co-prises the am^no 
acid sequerioe MDYQVSSPKDINTnrrSE . . 

in a second embodiment, the present Invention 
provides a polypeptide comprising the amino acid sequence 
XEXIXIVXXXNYXXX, wherein X is any synthetic or naturally 
occurring amino acid, and the polypeptide cor^rises less 
than about 100 contiguous amino acids, preferably less 
than about 50 amino acids, and more preferably less than 
about 25 amino acids, that are identical to or 
substantially Identical to the amino acid se^ence of the 
human CZC^i' chemokine receptor. Optionally, the 
polypeptide consists essentially of, or consists of, 

sequence EXIXIYXXXHT. 

in a preferred polypeptide of this second 
e,^diment, the polypeptide comprises the amino ac.d 
sequence M*EG*IS.IVT.S*I)*fIVT*fi*E* . Preferably, 
J „.EQ.IS.IYT*S.D*NYT.E.E. is M.EGISIYTSDNVT.E.E* . 
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In a third embodiment, the present invention 
provides a polypeptide comprising the amino acid sequence 
EHQAFLQFS, wherein the polypeptide comprises less than 
about 100 contiguous amino acid residues, preferably less 
than about 50 contiguous amino acid residues, more 
preferably less than about 25 contiguous amino acid 
residues, that are identical to or stibstantially . 
identical to the amino acid sequence of the human STRL33 
chemokine receptor. .The polypeptide can consist 
essentially of, or consist of, the sequence EHQAFLQFS . 

The first three embodiments of the present invention 
provide, among other things, polypeptides having 
substantial identity or identity to the amino-terminal 
regions of the chemokine receptors CCR5, CXCR4, and 
STRL33. These first three embodiments form a first group 
of embodiments of the present invention. The present 
invention also provides, in a second group of 
embodiments, polypeptides having substantial identity or 
identity to an internal region of the human chemokine 
receptors CCR5, CXCR4, and STRL3 3, as well as to the 
leukocyte cell-surface protein CD4. 

This second group of embodiments provides a 
polypeptide that binds with HIV gpl20 under physiological 
conditions and comprises at least a portion of or .all of 
an amino acid sequence selected from the group consisting 
of LPPLYSLVFIPGPVGNML, QWDPGNTMCQLLTGLYFIGPFS, 
SQYQFWKNPQTLKIVILG, APYNIVLLLNTPQEFFGLNNCS , and 
YAFVGEKPRNYLLVPPQK, wherein the polypeptide comprises 
less than about 100 amino acids that are identical to or 
substantially identical to the amino acid sequence -of the 
human CCR5 chemokine receptor; or selected from the group 
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consisting of LLLTIPDFIFANVSEADD {165-182) , 

WFQFQHIMVGLILPGIV (197-214) , and IDSFILLEIIKQGCEFEN 
* (261-278) , wherein the polypeptide comprises less than 
. about 100 amino acids that are identical to or 

stibstantially identical to the amino acid sequence of the 

human CXCR4 chemokine receptor; or 

selected from the group consisting of 

LVISIFYHKLQSLTDVPL (53-70) > PFWAYAGIHEWVPGQVMC (85-102), 

EAISTWLATQMTLGFFL (185-202), LTMIVCYSVIIKTLLHAG (205- . 

222), MAVFLLTQMPFNLMKPIRSTHW (237-258), 

HWEYYAMTSFHYTIMVTE (257-274), ACLNPVLYAFVSLKFRKN (281- 
298) and SKTFSASHNVEATSMFQL (325-342) , wherein the 
polypeptide comprises less than about 100 amino acids 
that are identical to a substantially identical to the 
amino acid sequence of the human STRL33 chemokine 
receptor; or 

selected from the group consisting of DTYICEVED, 
EEVQLLVFGLTANSD, THLLQGQSLTLTLES , and GEQVEFSFPLAFTVE, 
wherein the polypeptide binds with HIV gpl20 lender 
physiological conditions and comprises less than about 
100 amino acids that are identical to or sxibstantially 
identical to the amino acid sequence of . the human CD4 
cell-surface protein. Optionally, the recited amino acid 
sequences can comprise 1 to about € conservative or 
neutral amino acid substitutions. 

The polypeptides of this second group of embodiments 
preferably comprise less than about 50 amino acid 
residue.s, and more preferably less than about 25 amino 
acid residues, and yet more preferably no additional 
amino acid residues, that are identical to a protein that 
nat;urally has the recited amino acid sequence. The 
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polypeptide can be alternatively characterized by an 
absence of a region, outside the above -recited amino acid 
sequences, that has about five, or about ten, contiguous 
amino acid residues that have a sequence that consists of 
an amino identical and conservatively substituted 
residues as an amino acid sequence of the protein to 
which the polypeptide of the compound has homology. 

Any embodiment of the present inventive polypeptide 
can also comprise a pharmaceutically acceptable 
substituent, attachment of which is within the skill in 
the art. The pharmaceutically acceptability of 
substituents are understood by those skilled in the art. 
For example, a pharmaceutically acceptable substituent 
can be a biopolymer, such as a polypeptide, an RNA, a 
DNA, or a polysaccharide. Suitable polypeptides comprise 
fusion proteins, an antibody or fragment thereof, a cell 
adhesion molecule or a fragment thereof, or a peptide 
hormone. Suitable polysaccharides comprise polyglucose 
moieties, such as starch and their derivatives, such as 
heparin. The pharmaceutically acceptable substituent 
also can be any suitable , lipid or lipid -containing 
moiety, such as a lipid of a liposome or a vesicle, or 
even a lipophilic moiety, such as a prostaglandin, a 
steroid hormone, or a derivative thereof. Additionally, 
the pharmaceutically acceptable substituent can be a 
nucleotide or nucleoside, such as nicotine adenine 
dinucleotide or thymine, an amino acid residue, a 
saccharide or disaccharide, or the residue of another 
biomolecule naturally occurring in a cell, such as 
inositol, a vitamin, such as vitamin C, thiamine, or 
nicotinic acid. Synthetic organic moieties also can be 
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pharmaceutical ly acceptable substituents, such as t -butyl 
carbonyl, an acetyl moiety, quinine , polystyrene and 
other biologically acceptable polymers. Optionally, a 
pharmaceutically acceptable siibstituent can be selected 
from the. group consisting of a Ca.-Ci8 alkyl, a Ca-Cjg 
alkenyl, a C^-C^s alkynyl, a Cg-Cig aryl, ^ Ci-C^^s alkaryl, a 
aralkyl, and a C^-C^^ cycloalkyl, wherein any of the 
foregoing moieties that are cyclic comprise from 0 to 2 
atoms per carbocyclic ring, which can be the same or 
different, and are selected from the group consisting of 
nitrogen, oxygen, and sulfur. 

Any of the sxibstituents from this group can be 
substituted by one to six substituent moieties, which can 
be the same or different, selected from the group 
consisting of an amino moiety, a carbamate moiety, a 
carbonate moiety, hydroxyl, a phosphamate moiety, a 
phosphate moiety, a phosphonate moiety, a pyrophosphate 
moiety, a triphosphate moiety, a sulfaraate moiety, a 
sulfate moiety, a sulfonate moiety, a C^-Cg monoalkylamine 
moiety, a Ci-Cg dialkyl amine moiety, and a Cj^-C^ 
trialkylamine moiety. 

Any embodiment of the present inventive polypeptide 
can be encoded by a nucleic acid and can be expressed in 
a cell. The skilled artisan will recognize that the 
encoded polypeptide as well as any phainnaaceutically 
acceptable substituent to be incorporated into the 
polypeptide, e»g., a formyl or acetyl substituent on an 
amino "terminal methionine or a saccharide, will 
preferably be produced by a cell that can express the 
polypeptide of the present invention. Accordingly, the 
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amino acids incorporated into the polypeptide encoded by 
the nucleic acid are preferably naturally occurring. • 

A nucleic acid as described above can be cloned into 
any suitable .vector and can be used to transduce, 
transform, or transfect any suitable host. The selection 
of vectors and methods to construct them are commonly 
known to persons of ordinary skill in the art and are 
described in general technical references (see, in 
general, ^'Recombinant DNA Part D," Methods in Enzymology, 
Vol. 153, Wu and Grossman, eds.. Academic Press (1987) ) • 
Desirably, the vector comprises regulatory sequences, 
such as transcription and translation initiation and 
termination codons, which are specific to the type of 
host (e.g., bacterium, fungus, plant, or animal) into 
which the vector is to be inserted, as appropriate and 
taking into consideration whether the vector is DNA or 
RNA. Preferably, the vector comprises regulatory 
sequences that are specific to the genus of the host. 
Most preferably, the vector comprises regulatory 
sequences that are specific to the species of the host 
and is optimized for the expression of an above-described 
polypeptide . 

Constructs of vectors, which are circular or linear, 
can be prepared to contain an entire nucleic acid 
sequence as described above or a portion thereof ligated 
to a replication system that is functional in a 
prokaryotic or eukaryotic host cell.^ Replication systems 
can be derived from ColEl, 2 ra|Lt plasmid, X, SV40, bovine 
papilloma virus, and the like. . 

Suitable vectors include those designed for 
propagation and expansion, or for expression, or both. A 
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preferred cloning vector is selected from the group 
consisting of the pUC series, the pBluescript series 
(Stratagene, LaJolla, CA) , the pET series (Novagen, 
Madison, WI) , the pGEX series (Pharmacia Biotech, 
Uppsala, Sweden) , and the pEX series (Clonetech, Palo 
Alto, CA) . Examples of animal expression vectors include 
pEUK-Cl, pMAM and pMAMneo (Clonetech, Palo Alto, CA) . 

An expression vector can comprise a native or 
nonnative promoter operably linked to a nucleic acid 
molecule encoding an above -described polypeptide. The 
selection of promoters, e,g., strong, weak, inducible, 
tissue-specific and developmental -specific, is within the 
skill in the art. Similarly, the combining of a nucleic 
acid molecule as described above with a, promoter is also 
within the skill in the art. 

The skilled artisan will also recognize that the 
polypeptide has ability to bind the gpl20 protein, which 
is most often found outside of cells. Accordingly, the 
present inventive nucleic acid advantageously can 
comprise a nucleic acid sequence that encodes a signal 
sequence such that a signal sequence is translated as a 
fusion protein with the polypeptide of the present 
inventive polypeptide to form a signal sequence- 
polypeptide fusion. The signal sequence can cause 
secretion of the entire polypeptide, including the signal 
sequence (which is a pharmaceutically acceptable 
substituent) , or can be cleaved from the polypeptide 
(i.e.,- the polypeptide of the compound) prior to, or 
during, secretion so that at least the present inventive 
polypeptide is secreted out of a cell in which the 
nucleic acid is expressed. 
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Alternatively, the nucleic acid comprises or encodes 
an antisense nucleic acid molecule or a ribozyme that is 
specific for a specified amino acid sequence of an above- 
described polypeptide. A nucleic acid sequence 
introduced in antisense suppression generally is 
substantially identical to at least a portion of the 
endogenous gene or gene to be repressed, but need not be 
identical. Thus, the vectors can be designed such that 
the inhibitory effect applies to other proteins within a 
family of genes exhibiting homology or substantial 
homology to the target gene. The introduced sequence 
also need not be full-length relative to either of the 
primary transcription product or the fully processed 
mRNA. Generally, higher homology can be used to 
compensate for the use of a shorter sequence. 
Fxirthemiore, the introduced sequence need not have the 
same intron or exon pattern, and homology of non-coding 
segments will be equally effective - 

Ribozymes also have been reported to have use as a 
means to inhibit expression of endogenous genes. It is 
possible to design ribozymes that specifically pair with 
virtually any target RNA and cleave the phosphodiester 
backbone at a specific location, thereby functionally 
inactivating the target RNA. In carrying out this 
cleavage, the ribozyme is not itself altered and is, 
thus, capable of recycling and cleaving other molecules, 
making it a true enzyme. The inclusion of ribozyme 
sequences within antisense RNAs confers RNA-cleaving 
activity upon them, thereby increasing the activity of 
the constructs. The design and use of target RNA- 
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specific ribozymes is described in Haseloff et al., 
Nature 334: 585-591 (1988). 

Further provided by the present invention is a 
composition comprising an above -described polypeptide or 
5 nucleic acid and a carrier therefor. Another composition 
provided by the present invention is a coit^osition 
comprising an antibody to an above -described polypeptide 
or an anti-antibody to an above -described polypeptide. 
Any embodiment of the present invention including 
10 the present inventive polypeptide, nucleic acid, 
t; antibody, and ant i- antibody, can be incorporated into a 

Cl coTiposition comprising a carrier. The carxier can serve 

03 

J any fiinction. For example, the carrier can increase the 

fp . - . 

?J solubility of the present inventive polypeptide, nucleic 

W 15 acid or antibody in aqueous solutions. Additionally, the 

£3 carrier can protect, the present inventive polypeptide, 

i|t nucleic acid or antibody from environmental insults, such 

as dehydration, oxidation, and photolysis. • Moreover, the 
f|J carrier can serve as an adjixvant, or as a timed-release 

20 control means in a biological system. 

Antibodies can be generated in accordance with 
methods known in the art. See, for example, Benjamin, In 
Immunology: a short course, Wiley-Liss, NY, 1996, pp. 
436-437; Kuby, In Immunology, 3rd. ed. , Freeman, NY, 
25 1997, pp. 455-456; Greenspan et al., FA3EB J. 7: 437-443 
(1993); and Poskitt, Vaccine 9: 792-796 (1991). Anti- 
antibodies (i.e., anti-idiotypic antibodies) also can be 
generated in* accordance with methods known in the art 
(see, for example, Benjamin, Jn Immunology: a short 
30 course, Wiley-Liss, NY, 1996, pp. 436-437; Kuby, In 
Immunology, 3rd. ed.. Freeman, NY, 1997, pp. 455-456; 
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Greenspan et al., FASEB J. , 7, 437-443, 1993; Poskitt, 
Vaccine . 9, 792-796, 1991; and Madiyalakan et al., 
Hyhridonor 14: 199-203 (1995) ("Anti-idiotype induction 
therapy" ) ) . Such antibodies can be obtained and employed 
either in solution-phase or coupled to a desired solid- 
phase matrix. Having in hand such antibodies, one 
skilled in the art will further appreciate that such 
antibodies, using well-established procedures (e.g., such 
as described by Harlow and Lane (1988, supra ) . are useful 
in the detection, quantification, or purification of 
gpl20 or HIV, particularly HIV-1, conjugates of each and 
host cells transformed to produce a gpl20 receptor or a 
derivative thereof. Such antibodies are also useful in a 
method of prevention or treatment of a viral infection 
and in a method of inducing an immune response to HIV as 
provided herein. 

In view of the above, an above -described polypeptide 
can be administered to an animal. The animal generates 
anti -polypeptide antibodies. Among the ant i -polypeptide 
antibodies generated or induced in the animal are 
antibodies that have an internal image of gpl20. In 
accordance with well-known methods, polyclonal or 
monoclonal antibodies can be obtained, isolated and 
selected. Selection of an anti -polypeptide antibody that 
has an internal image of gpl20 can be based upon 
competition between the anti -polypeptide antibody and 
gpl20 for binding to an above-described polypeptide, or 
upon the ability of the anti -polypeptide antibody to bind 
to a free polypeptide as opposed to a polypeptide bound 
to gpl20. Such an anti-antibody can be administered to 
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an animal to prevent or treat an HIV infection in 
accordance with methods provided herein. 

Although nonhuman anti- idiotypic antibodies, such as 
an ant i -polypeptide antibody that has an internal image 
of gpl20 and, therefore, is anti-idiotypic to gpl20, are 
useful for prophylaxis in humans, their favorable 
properties might, in certain instances, can be further 
enhanced and/or their adverse properties further 
diminished, through "humanization" strategies, such as 
those recently reviewed by Vaughan, Nature Biotech. . 16, 
535-539, 1998. 

Prior to administration to an animal, such as a 
mammal, in particular a human, an above-described 
polypeptide, nucleic acid, antibody or anti-antibody can 
be formulated into various compositions by combination 
with appropriate carriers, in particular, 
phartnaceutically acceptable carriers or diluents, and can 
be formulated to be appropriate for either human or 
veterinary • applications • 

The present invention also provides a method of 
making an antibody. The method comprises administering 
an immunogenic amount of an above -described polypeptide 
or nucleic acid to an animal, such as a mammal, in 
particular a human. Determining the quantity of a 
polypeptide or nucleic acid that is immunogenic will 
depend in part on the degree of similarity to a protein 
or other molecule of the inoculated, animal , the route of 
administration of the polypeptide or nucleic acid, and 
the size of the polypeptide administered or encoded by 
the administered nucleic acid. If necessary, the 
polypeptide or nucleic acid can be mixed with or ligated 
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to a sxxbstance (or an adjuvant) that enhances its 
immunogenicity. Such calculations and procedures are 
within the skill of the ordinary artisan. Additionally, 
the present inventive method preferably can be used to 
5 induce an immxine response against HIV, particularly 
HIV-1, in a mammal, particularly a human. 

In view of the above, the present invention further 
provides a method of prophylactically or therapeutically 
treating an HIV infection in a mammal, particularly a 
10 human, in need thereof. The method comprises 
S administering to the mammal an HIV replication- inhibiting 

^ effective amount of an above -described polypeptide, 

■■fi nucleic -acid, or an anti-antibody to an above-described 

polypeptide or a nucleic acid encoding such a 
15 polypeptide. 
O ' The present invention also provides a method of 

prophylactically or therapeutically treating HIV 
,4 infection in a mammal. The method comprises 

iU administering to the mammal an effective amount of an 

20 above-described polypeptide or nucleic acid. Prior to 
administration to an. animal, such as a mammal, in 
particular a human, an above -described polypeptide or 
nucleic acid can be formulated into various compositions 
by combination with appropriate carriers, in particular, 
25 pharmaceutically acceptable carriers or diluents, and can 
be formulated to be appropriate for either human or 
veterinary applications. 

Thus, a composition for use in the method of the 
present invention can comprise one or more of the 
30 polypeptides, nucleic acids, antibodies or anti- 
antibodies described herein, preferably in combination 
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with a pharmaceutically acceptable carrier. 
Pharmaceutical ly acceptable carriers are well-known to 
those skilled in the art, as are suitable methods of 
administration. The choice of carrier will be 
determined, in part, by whether a polypeptide or a 
nucleic acid is to be administered, as well as by the - 
particular method used to administer the composition. 
Optionally, the carrier can be selected to increase the 
solubility of the con5)osition or mixture, e.g., a 
liposome or polysaccharide. One skilled in the art will 
also appreciate that various routes of administering a 
composition are available, and, although more than one 
route can be used for administration, a particular route 
can provide a more immediate and more effective reaction 
than another route. Accordingly, there are a wide 
variety of suitable formulations of compositions that can 
be used in the present inventive methods. 

A cotr5)osition in accordance with the present 
invention, alone or in further combination with one or 
more other active agents, can be made into a formulation 
suitable for parenteral administration, preferably 
intraperitoneal administration. Such a formulation can 
include aqueous and nonaqueous, isotonic sterile 
injection solutions, which can contain antioxidants, 
buffers, bacteriostats, and solutes that render the 
formulation isotonic with the blood of the intended 
recipient, and aqueous and nonaqueous sterile suspensions 
that can include suspending agents, solubilizers, 
thickening agents, stabilizers, and preservatives. The 
formulations can be presented in unit dose or multi-dose 
sealed containers, such as ampules and vials, and can be 
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stored in a freeze-dried • (lyophilized) . condition 
requiring only the addition of the sterile liquid 
carrier, for example/ water , for injections, itntnediately 
prior to use. Extemporaneously injectable solutions and 
suspensions can be prepared from sterile powders, 
granules, and tablets,- as described herein. 

A formulation suitable for oral administration can 
consist of liquid solutions, such as an effective amoiint 
of the compound dissolved in diluents, such as water, 
saline, or fruit juice; capsules, sachets or tablets, 
each containing a predetermined amount of the active 
ingredient, as solid or granules; solutions or 
suspensions in an aqueous liquid; and oil-in-water 
emulsions or water- in-oil emulsions. Tablet forms can 
include one or more of lactose, mannitol, corn starch, 
potato starch, microcrystalline cellulose, acacia, 
gelatin, colloidal silicon dioxide, croscarmellose 
sodium, talc, magnesium stearate, stearic acid, and other 
excipients, colorants', diluents, buffering agents, 
moistening agents, preservatives, flavoring agents, and 
pharmacologically compatible carriers. 

Similarly, a formulation suitable for oral 
administration can include lozenge forms, which can 
comprise the active ingredient in a flavor, usually 
sucrose and acacia or tragacanth; pastilles comprising 
the active ingredient in an inert base, such as gelatin 
and glycerin, or sucrose and acacia; and mouthwashes 
comprising the active ingredient in a suitable liquid 
carrier; as well as creams, emulsions, gels, and the like 
containing, in addition to the active ingredient, such 
carriers as are known in the art. 
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An aerosol formulation suitable for administration 
via inhalation also can be made. The aerosol formulation 
can be placed into a pressurized acceptable propellant, 
such as dichlorodifluorome thane, propane, nitrogen, and 
5 the like. 

A formulation suitable for topical application can 
be in the form of creams, ointments, or lotions. 

A formulation for rectal administration can be 
presented as a suppository with a suitable base 

10 comprising, for example, cocoa butter or a salicylate. A 
formulation suitable for vaginal administration can be 
presented as a pessary, tampon, cream, gel, paste, foam, 
or spray formula containing, in addition to the active 
ingredient, such carriers as are known in the art to be 

15 appropriate . 

Important ' general considerations for design of 
delivery systems and compositions, and for routes of 
administration, for polypeptide -^drugs also apply 
(Eppstein, CRC Crit , Rev. Therapeutic Drucr Carrier 

20 Systems 5, 99-13 9, 1988; Siddiqui et al . , CRC Crit. Rev. 
Therapeutic Drug Carrier Systems 3, 195-208, 1987); Banga 
et al.. Int. J, Pharmaceutics 48, 15-50, 1988; Sanders, 
Eur. J. Drug Metab- Pharma cokinetics 15, 95-102, 1990; 
Verhoef, Eur, J. Drug Metab. Pharmacokinetics 15, 83-93, 

25 1990) • The appropriate delivery system for a given 

polypeptide will depend upon its particular nature, the 
particular clinical application, and the site of drug 
action. As with any protein drug, oral delivery will 
likely present special problems, due primarily to 

30 instability in the gastrointestinal tract and poor 

absorption and bioavailability of intact, bioactive drug 
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therefrom. Therefore, especially in the case of oral 
delivery, but also possibly in conjunction with other 
routes of delivery, it will be necessary to use an 
absorption-enhancing agent in combination with a given 
5 polypeptide. A wide variety of absorption-enhancing 
agents have been investigated and/or applied in 
combination with protein drugs for oral delivery and for 
delivery by other routes (Verhoef , 1990, supra ; van 
. Hoogdalem, Pharmac . Thar , 44, 407-43, 1989; Davis, 
■ 10 Pharm, Pharmacol. 44(Supid1, 1), 186-90, 1992). Most 
!;i commonly, typical enhancers fall into the general 

13 categories of (a) chelators, such as EDTA, salicylates, 

j; and N-acyl derivatives of collagen, (b) surfactants, such 

5q as lauryl sulfate and polyoxyethylene-9-lauryl ether, (c) 

Ijj 15 bile salts, such as glycholate and taurocholate, and 

Q ' derivatives, such as taurodihydrofusidate, (d) fatty 

yi acids, such as oleic acid and capric acid, and their 

i! y 

'"-I derivatives, such , as acylcamitines, monoglycerides, and 

nj diglycerides , (e) non- surfactants, such as unsaturated 

20 cyclic ureas, (f) saponins, (g) cyclodextrins , and (h) 
phospholipids . 

Other approaches to enhancing oral delivery of 
protein drugs can include the aforementioned chemical 
modifications to enhance stability to gastrointestinal 
25 enzymes and/ or increased lipophilicity. Alternatively! 
the protein drug can be administered in combination with 
other drugs or substances that directly inhibit proteases 
and/or other potential sources of enzymatic degradation 
of proteins. Yet another alternative approach to prevent 
30 or delay gastrointestinal absorption of protein drugs is 
to incorporate them into a delivery system that is 
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designed to protect the protein from contact with the 
proteolytic enzymes in the intestinal lumen and to 
release the intact protein only upon reaching an area 
favorable for its absorption. A more specific example of 
this strategy is the use of biodegradable microcapsules 
or microspheres, both to protect vulnerable drugs from 
degradation, as well as to effect a prolonged release of 
active drug (Deasy, iil rToencaioaulation and Related 
Processes . Swarbrick, ed., Marcell Dekker, Inc.: New 
York, 1984, pp. 1-60, 88-89, 208-11). Microcapsules also 
can provide a useful way to effect a prolonged delivery 
of a protein drug after injection (Maulding, ^ 
gbntrolled Release 6. 167-76, 1987). 

The dose administered to an animal, such as a 
mammal, particularly a human, in the context of the 
present invention should be sufficient to effect a 
therapeutic or prophylactic response in the individual 
over a reasonable time frame. The dose will be 
determined by the particular polypeptide, nucleic acid, 
antibody, or anti-antibody administered, the severity of 
any existing disease state, as well as the body weight 
and' age of the individual . The size of the dose also 
will be determined by the existence of any adverse side 
effects that may accompany the use of the particular 
• polypeptide, nucleic acid, antibody or anti-antibody 
employed. It is always desirable, whenever possible, to 
keep adverse side effects to a minimum. 

The dosage can be in unit dosage form, such as a 
tablet or capsule. The term "unit dosage form" as used 
herein refers to physically discrete units suitable as 
unitary dosages for human and animal subjects, each unit 
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containing a predetermined quantity of a vector, alone or 
in cotnbination with other active agents, calculated in an 
amount sufficient to produce the desired effect in 
association with a pharmaceutically acceptable diluent, 
5 carrier, or vehicle. The specifications for the unit 
dosage forms of the present invention depend on the 
particular embodiment eicployed and the effect to be 
achieved, as well as the pharmacodynamics associated with 
each polypeptide, • nucleic acid or anti-antibody in the 
- 10 host. The dose administered should be an ^HIV infection 
5^ inhibiting amounf of an above-described polypeptide or 

O nucleic acid or an immune response -inducing effective 

amount" of an above -described polypeptide, an above- 
described nucleic acid, or an antibody as appropriate. 
15 Another composition provided by the present 

invention is a composition comprising a solid support* 
matrix to which is attached an above -described 
polypeptide, or an anti-antibody to an above -described 
polypeptide. The solid matrix can comprise other 
20 functional reagents including, for example, polyethylene 
glycol, dextran, albumin and the like, whose intended 
effector functions may include one or more of the 
following: to improve stability of the conjugate; to 
increase the half-life of the conjugate; to increase 
25 resistance of the conjugate to proteolysis; to decrease 
the immunogenicity of the conjugate; to provide a means 
to attach or immobilize a functional polypeptide or anti- 
antibody onto a solid support matrix (e.g., see, for 
example, Harris, in PqIv (Ethylene Glycol) Chemistry; 
30 Biotechnical and B iomedical Applications. Harris, ed., . 
Plenum Press: New York (1992), pp. 1-14). Conjugates 



ills 



hi 



m 

i ^ 



28 

furthermore may comprise a polypeptide or anti-antibody 
coupled to an effector molecule, each of which, 
optionally, may have different functions (e.g., such as a 
toxin molecule (or an immunological reagent) and a 
polyethylene glycol (or dextran or albumin) molecule) . 
Diverse applications and uses of fTinctional proteins and 
polypeptides, attached to or immobilized on a solid 
support matrix, are exemplified more specifically for 
poly (ethylene glycol) conjugated proteins or peptides in 
a review by Holmberg et al* (In Polv (Ethylene Glvcol) 
Chemistry: Biotechnical and Biomedical Applications > 
Harris, ed.. Plenum Press: New York, 1992, pp. 303-324). 

In addition, the present invention provides a method 
of removing HIV from a bodily fluid of an animal. The 
method comprises extracorporeally contacting the bodily 
fluid of the animal with a solid- support matrix to which 
is attached an above -de scribed polypeptide or an anti- 
antibody to an above-described polypeptide. 
Alternatively, the bodily fluid can be contacted with the 
polypeptide or aiiti-antibody in solution and then .the 
solution can be contacted with a solid support matrix to 
which is attached a means to remove the polypeptide or 
anti-antibody to which is bound HIV gpl20 from the bodily 
fluid. 

Methods of attaching an here in -described 
polypeptide, or an ant i -antibody to a solid support 
matrix are known in the art. '^^Attached" is used herein 
to refer to attachment to (or coupling to) and 
immobilization in or on a solid support matrix. See, for 
example, Harris, in Polv (Ethylene Glvcol) Chemistry; 
Biotechnical and Biomedical Applications , Harris, ed. , 
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Plenum Press: New York (1992), pp. 1-14) and 
international patent application WO 91/02714 (Saxinger) . 
Diverse applications and uses of fxmctional polypeptides 
attached to or immobilized on a solid support matrix are 
exemplified more specifically for poly (ethylene glycol) 
conjugated proteins or peptides in a review by Holmberg 
et al. (In Poly (Ethy l f^^ie Glycol) Chemistrvt Biotechnical 
and Biomedical Applications . Harris, ed.. Plenum Press: 
New York, 1992, pp. 303-324) . 

The present invention also provides a method of 
making an antibody that binds to gpl20 of HIV under 
physiological conditions. The method comprises labeling 
an embodiment of the present inventive compound to obtain 
a labeled compound. Labeling compounds are within the 
skill of the ordinary artisan. Pox example, the present 
inventive compound can be labeled with radioactive atom, 
such as "^I in the same or a similar manner as was 
performed in the examples provided below. Alternatively, 
an enzyme, such as horseradish peroxidase, can be 
attached to or incorporated into the present inventive 
compound. Then by exposing a chromogenic or photogenic 
compound to the compound, a signal indicative of the 
presence and quantity of the compound present can be 
generated. In another alternative, a polyhistidinyl 
moiety can be attached to, or incorporated into, the 
present inventive moiety so that the present inventive 
compound will react with high affinity to transition 
metal ions such as nickel, copper, or zinc ions; this 
reaction can be used as the basis to quantify the amount 
of the present inventive compound present at a particular 
location. In yet another alternative, the present 
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inventive compound can be used as antigen to a standard 
antibody that specifically recognizes an antigenic 
epitope of the present inventive compound. As is well- 
known, the standard antibody can itself be labeled or 
used in conjunction with an additional antibody that is 
labeled with an enzyme, radioisotope, or other suitable 
means. The skilled artisan will recognize that there is 
a plethora of other suitable means and methods to label 
the present inventive compound. 

This present inventive method of making an antibody 
that binds to a gpl20 envelope protein of HIV further 
comprises providing a library of synthetic peptides. The 
library consists of a multiplicity of synthetically- 
produced polypeptides that are homologous, and preferably, 
essentially identical (i.e., having the same primary 
amino acid residue sequence, ignoring blocking groups, 
phosphorylation of serinyl, threoninyl, and tyrosinyl 
residues, hydroxylation of prolinyl residues, and the 
like) or identical, to a continuous region of an HIV 
gpl20 envelope protein. The polypeptides of the librai^/ 
can be any suitable length. While larger regions allow 
faster scanning and tend to preserve non-linear epitopes > 
shorter length polypeptides allow more sensitive 
screening of the primary sequence of the gpl20 protein. 
However, polypeptides that are too short can lose 
essential secondary structure or cleave reactive sites 
into one or more pieces. Preferably, a mixture of short 
and long polypeptides are incorporated into the library, 
however, the library can consist of polypeptides of a 
single length (measured in amino acid residues) . For the 
sake of convenience the library can be split into 
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multiple parts, and screened by parts. Typically, the 
polypeptides of the library will be between about 6 and 
about 45 amino acid residues in length - 

Typically, the library will comprise a series of 
polypeptides each having an identical sequence to that of 
gpl20 but having an amino -terminus a particular number of 
amino acids downstream of the amino-terminus of the prior 
polypeptide (see, examples section below) . The distance, 
measured in amino acid residues, is referred to as the 
offset. Preferably, libraries that are characterized by 
the existence of an offset, the offset is not greater 
than the product of length of the longest polypeptide 
measured in amino acid residues and 1.5, preferably 1.0, 
and more preferably 0.5. The library can be 
alternatively characterized by the existence of an offset 
not greater than 30, preferably 15, and more 
preferably 4 . 

Each polypeptide of the library is substantially 
isolated from every other polypeptide of said library and 
is located in a known position. For example, each 
polypeptide can be bound to a solid support and that is 
in a vessel or that can be placed in a vessel. The 
vessel preferably enables each polypeptide to be covered 
in a liquid that does not contact any other 
oligonucleotide of the library. By way of example, each . 
polypeptide can be boxind to a bead that is placed in a 
vessel (or tube) or can be bound to the well of a multi- 
well assay plate. Alternatively, an array of 
polypep^tides can be fashioned, for example on a microchip 
device (as is presently used in some DNA sequencing 



32 

devices and methods) , and the entire array can be bathed 
in a single solution. 

Each polypeptide is then individually contacted with 
the labeled compound such that a portion of the labeled 
compound can bind with the polypeptide of the library. 
In this way, a boxmd population of each labeled compound 
of the present invention and an unbound population of the 
labeled compound is generated. The phrase individually 
contacted means that each polypeptide has the opportunity 
to bind with the labeled compound and the quantity of 
labeled compoxind bound by each can be determined. 

The method then comprises removing substantially all 
of the unbound labeled compound from the position 
occupied by each polypeptide. That is, the solution 
comprising the labeled compound is separated from the 
polypeptides of the library and the bound population of 
the labeled compound. This can be done by any suitable 
method, e-g., by aspiration and one or more washing steps 
comprising adding a quantity of liquid sufficient to 
cover all the surfaces that were contacted by the labeled 
compound and aspirating away substantially all of the 
wash liquid. 

The amount of labeled compound that remains 
co-localized with each polypeptide of the library is then 
measured to determine the quantity of labeled compound 
bound by each polypeptide. The amount of the present 
inventive compound bound by each polypeptide can be 
directly evaluated to identify a portion of the HIV gpl20 
envelope protein that binds to an (HIV) -receptor selected 
from the group consisting of CCR5, CXCR4, STRL33, and 
CD4. This information is then used to identify and 
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provide an inmiTinizing compoxand. The imnrunizing compound 
comprises a polypeptide con^rising an amino acid sequence 
that is homologous to, or preferably is essentially 
identical to, or identical to, the portion of the HIV-1 
gpl20 envelope protein that binds with CD4, CCR5, CXCR4, 
and/or STRL33. The immunizing protein can be provided by 
processing gpl20, e.g., proteolytically digesting gpl20 
that has been isolated from a preparation of HIV-1. 
Preferably, however, the immunizing compound is prepared 
synthetically, or by genetic engineering, or by a 
combination of genetic engineering and synthetic methods. 
The iramimizing compound can comprise a pharmaceutical ly 
acceptable substituent, can be encoded by a nucleic acid 
that can be expressed in a cell, can be mixed with a 
carrier, and is an inventive aspect of the present 
invention. 

An immunogenic quantity of the immunizing compound 
is then inserted into an animal (e.g., a human, or a 
rodent, a canine, a feline, or a ruminant) in a manner 
consistent with the discussion 'of a method of raising an 
antibody to the present inventive compounds that are 
homologous to portions of CC315, CXCR4, STRL33, and CD4, 
above. The insertion of the immunizing compound causes 
the inoculated animal to produce an antibody that binds 
with said portion of the HIV gpl20 envelope protein. 
Thus the present invention also provides an antibody that 
binds to an HIV gpl20 envelope protein, as well as an 
antigen binding protein comprising one or more 
complementarity determining regions of the antibody 
(e.g., a Fab, a Fab^., an Fv, a single-chain antibody, a 
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diabody, and humanized variants of all of the above, all 
of which are within the skill in the art) . 

The antibody or variant thereof is preferably useful 
in detecting or diagnosing the presence of HIV gpl20 
envelope protein, and thus HIV, in an animal. The 
antibody is also preferably prevents or attenuates 
infection of an animal exposed to HIV, to whom an 
effective quantity of the antibody or a variant thereof, 
has been administered or produced in response to 
inoculation with the immunizing compound. The antibody 
preferably also is useful in treating or preventing 
(i.e., inhibiting) HIV infection in an animal to whom a 
suitable dose has been administered or in which a 
suitable quantity of antibody has been produced. The 
antibody is also useful in the study of HIV infection of 
mammalian cells, the host range specificities of HIV 
infection, and preferably, the mechanism by which 
antibodies neutralize infectious viruses. 

EXAMPLES 

The following examples further illustrate the 
present Invention but, of course,* should not be construed 
as limiting the scope of the claimed invention in any 
way. 

Synthetic peptide arrays were constructed in 9 6*- well 
microtiter plates in accordance with the method set forth 
in WO 91/02714 (Saxinger) , and used to test the binding 
of HIV-lij^ envelope gpl20 that had been labeled with 
radioactive iodine (radiolabeling by standard methods) . 
After incubating the radiolabeled gpl20 in a well with 
each synthetic peptide, a washing step was performed to 
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remove unbound label/ and the relative level of 
radioactivity remaining in each well of the plate was 
evaluated to determine the relative affinity of each 
peptide for the gpl20. The synthesis of the peptides and 
the quantity of binding between the synthetic peptides 
and the gpl20 were found to be suitably reproducible, 
precise, and sensitive. Initial screening of the entire 
primary sequence of the chemokine and CD4 receptor 
molecules was taken 18 amino acid residues at a time. 

The authenticity of the binding signals generated by 
this technique has been repeatedly demonstrated by 
showing that antibodies to CCR5 and CXCR4 are able to 
inhibit the binding of radiolabeled gpl20 to the 
polypeptides derived from CCR5 and CXCR4 that show a high 
affinity for binding with gpl20. Additionally, the 
accuracy of the binding assay used hereinbelow is 
demonstrated by Example 7. 

Example 1 

This example identifies segments of the CCR5 
co-receptor that -bind with gpl20. 

The first column in the table below indicates the 
number of the amino acid in the wild- type CCR5 receptor. 
The second column explicitly- identifies the peptide 
sequence. The third column indicates the radioactive 
counts recorded in twenty minutes (i.e., the cpm x 20) 
after the background or non-specific counts had been 
subtracted. The fourth column contains an X in each row 
for which the listed polypeptide bound with high affinity 
to gpl20. The fifth and final column contains^ an X in 
each row wherein the listed sequence binds with 
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siibstantial affinity but is weak in comparison to other 
samples, particularly adjacent samples. 
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CYSGILKTLLRCRl^EKKR 


70 


217- 


ILKTIiLRCRNEKKRHRAV 


19 


221- 


LLRCRNEKKRHRAVRLIF 


102 


225- 


RNEKKRHRAVRLIFTIMI 


23 


229- 


KRHRAVRLIFTIMIVYFL 


36 


233- 


AVRIjIFTIMIVYFLFWAP 


62 


237- 


I FTIMI VYFLFWAPYNI V 


121 


241- 


MIVYFLFWAPYNIVLLLN 


214 


245- 


FLFWAPYNIVLLLNTFQE 


616 


249- 


APYNIVLLIiNTFQEFFGIi 


1962 


253- 


IVLLLNTFQEFFGIjNNCS 


2134 


257- 


LNTFQEFFGLNNCSSSNR 


293 


261- 


QEFFGLNNCSSSNRLDQA 


63 


265- 


GLNNCSSSNRLDQAMQVT 


-31 


269- 


csssnrldqamqvtetlg 


90 


273- 


NRLDQAMQVTETLCMTHC 


10 


277- 


QAMQVTETLGMTHCCIN.P 


81 


281- 


VTETLGMTHCCINPI I YA 


15 


285- 


LGMTHCCINPI I YAFVGE 


282 


289- 


HCCINPI IYAFVGEKFRN 


200 


293- 


NPI I YAFVGEKFRNYLLV 


162 


297- 


YAFVGEKFRNYLLVFFQK 


596 


301- 


GEKFRNYLLVFFQKHIAK 


69 
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SOS- 


RNYLLVFFQKHIAKRFCK 


65 


SOS- 


LVFFQKHIAKRFCKCCSI 


76 


313- 


QiOIIAKRFCKCCSIFQQE 


23 


317- 


AKRFCKCCSIFQQEAPER 


64 


321- 


CKCCSIFQQEAPERASSV 


53 


325- 


S IFQQEAPERASSVYTRS 


100 


329- 


QEAPERASSVYTRSTGEQ 


84 


333- 


ERASSVYTRSTGEQEISV 


84 


337- 


SVYTRSTGEQEI SVGL 


47 



These data indicate that, in addition to polypeptide 
sequences derived from positions 1-18 of the CCR5 
receptor, the polypeptide sequences LPPLYSLVFIFGFVGISIML, 
QWDFGNTMCQLLTGLYFIGFFS , SQYQFWKNFQTLKIVILG , 
APYNIVLLLNTFQEFFGLIJ3NCS , and YAFVGEKFRNYLLVFFQK comprise 
multiple subsequences, each which is capable of binding 
to HIV-1 envelope gpl20* 

Example 2 

This example identifies segments of the CXCR4 
co-receptor that bind with gpl20. 

• The first column in the table below indicates the 
number of the amino acid in the wild- type CXCR4 receptor. 
The second column explicitly identifies the peptide 
sequence • The third and fourth columns indicate the 
radioactive counts recorded in twenty minutes (i.e., the 
cpm X 20) after the background or non-specific counts had 
been subtracted. The fifth column contains an X in each 
row for which the listed polypeptide bound with high 
affinity to gpl20. The sixth and final column contains 
an X in each row wherein the listed sequence binds with 
substantial affinity but is weak in comparison to other 
samples, particularly adjacent samples. 
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. SEQ SEG PEPTIDE Ma j or Minor 

Activity Activity 
Peak Peak 





enrpty v conurojL ; 


*!• iZ 


U 


1— 18 


MEGX S Xi ToiJrj x 1 h»i!ilYlvaiD 


oUUo 


2591 


5--22 


S I YTSDNY TEEMGooLJ x JJ 


4oo 


71 


9 — 26 


SDNYTEEMGSGDYDSMKB 


455 


43 


13-30 


TEEMGSGDYDSMKEPCFR 


453 


41 


17-34 


GSGDYDSMKEPCFREENA 


384 


-28 


21-38 


YDSMKEPCFREENANFNK 


466 


53 


25-42 


KEPCFRBENANFNKIFLP 


664 


252 


29-46 


FREBNANFNKIFLPTIYS 


463 


51 


33-50 


NANFNKIFLPTIYS I IFIi 


585 


173 


37-54 


NKIFLPTIYSIIFLTGIV 


550 


138 


41-58 


LPTI YS I IFLTGIVGNGIi 


530 


118 


45-62 


YS I IFLTGIVGNGLVILV 


535 


123 


49-66 


FLTGIVGNGLVILVMGYQ 


658 


246 


53-70 


IVGNGLVILVMGYQKKLR 


650 


238 


57-74 


GLVIIiVMGYQKKLRSMTD 


569 


157 


61-78 


LVMGYQiCKLRSMTDKYRli 


517 


105 


65-82 


YQKKLRSMTDKYRIiHLSV 


511 


99 


69-86 


LRSMTDKYRLHLSVADLL 


572 


160 


73-90 


TDKYRXjHIiSVADLLFVIT 


504 


92 


77-94 


RLHLSVADLLFVITLPFW 


548 


136 


81-98 


SVADLLFVITLPFWAVDA 


665 


253 


85-102 


LLFVI TLPFWAVDAVMTW 


475 


63 


89-106 


ITLPFWAVDAVANWYFGN 


542 


130 


93-110 


FWAVDAVANWYFGNFIiCK 


478 


66 


97-114 


DAVANWYFGNFLCKAVHV 


524 


112 


101-118 


NWYFGNFLCKAVHVXYTV 


oOo 


96 


105-122 


QNJ? XiCi\AVil V JL X 1 ViNJj X o 






109-126 


CKAVHVI YTVNLYS S VLI 


655 


243 


113-130 


HVIYTVNLYSSVLILAFI 


530 


118 


117-134 


TVNLYSSVLILAFISLDR 


654 


242 


121-138 


YSSVLILAFISLDRYLAI 


569 


167 


125-142 


LILAFISLDRYLAIVHAT 


519 


107 


129-146 


FISLDRYLAIVHATNSQR 


503 


91 


133-150 


DRYLAIVHATNSQRPRKL 


580 


168 


137-154 


AIVHATNSQRPRKLLAEK 


485 


73 


141-158 


ATNSQRPRKLLAEKWYV 


490 


78 


145-162 


QRPRKLLAEKVVYVGVWI 


539 


127 
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m 



149- 


166 


KLLAEKWYVGVWI PALL 


501 


89 


153- 


170 


EKWYVGVWIPALLLTIP 


559 


147 


157- 


174 


YVGVWI PALLLTI PDFIF 


536 


124 


161- 


178 


WI PALLLTI PDFI FANVS 


594 


182 


165- 


182 


LLLTIPDFIFANVSEADD 


1418 


1006 


169- 


186 


IPDFIFANVSEADDRYIC 


850 


438 


173- 


190 


IFANVSEADDRYICaDRFY 


679 


267 


177- 


•194 


VSEADDRYICDRFYPNDL 


569 


157 


181- 


•198 


DDRYICDRFYPNDLWVW 


537 


125 


185- 


•202 


ICDRFYPNDLWVWFQFQ 


718 


306 


189- 


■206 


FYPNDLWWVFQFQHIMV 


828 


416 


193- 


•210 


DLWWVFQFQHIMVCTiIL 


834 


422 


197- 


•214 


WFQFQHIMVGliILPGIV 


1001 


589 


201- 


-218 


FQHIMVGLILPGIVILSC 


582 


170 


205- 


-222 


MVGLILPGIVILSCYCII 


579 


167 


209- 


-226 


ILPGIVII.SCYCI I ISKL 


604 


192 


213- 


•230 


IVILSCYCI 1 1 SKLSHSK 


689 


277 


217- 


■234 


SCYCI I ISKLSHSKCfflQK 


671 


259 


221- 


-238 


IIISKLSHSKGHQKRKMi 


569 


157 


225- 


-242 


KLSHSKGHQKRKALKTTV 


542 


130 


229- 


-246 


SKGHQKRKALKTTVILIL 


552 


140 


233- 


■250 


QKRKAIiKTTVILILAFFA 


695 


283 


237- 


-254 


ALKTTVILILAFFACWIiP 


673 


261 


241- 


-258 


TVILILAFFACWLPYYIG 


735 


323 


245- 


-262 


ILAFFACWLPYYIGISID 


596 


184 


249- 


-266 


FACWliPYYIGISIDSFIL 


614 


202 


253- 


-270 


LPYYIGISIDSFILLEII 


851 


439 


257- 


-274 


IQISIDSFILLEIIKQGC 


1146 


734 


261- 


-278 


IDSFILLEI IKQGCEFEN 


3884 


3472 


265- 


-282 


ILLEIIKQGCEFENTVHK 


529 


117 


269- 


-286 


I IKQGCEFENTVHKWI S I 


518 


106 


273- 


-290 


GCEFENTVHKWISITEAL 


676 


264 


277 


-294 


ENTVHKWISITEALAFFH 


727 


315 


-281 


-298 


HKWISITEALAFFHCCLN . 


575 


163 


285 


-302 


SITBALAFFHCdjNPILY 


600 


188 


289 


-306 


AIAFFHCCLNPILYAFLG 


593 


181 


293 


-310 


FHCCLNPILYAFLGAKFK 


535 


123 


297 


-314 


LNPILYAFLGAKFKTSAQ 


686 


274 


301 


-318 


LYAFLQAKFKTSAQHALT 


568 


156 


305 


-322 


LGAKFKTSAQHALTSVSR 


612 


200 


309 


-326 


FKTSAQHftliTSVSRGSSL 


585 


173 


313 


-330 


AQHALTSVSRGSSLKILS 


559 


147 
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317 


-334 


LTSVSR6SSLKILSK6KR 


595 


183 


321 


-338 


SRGSSLKILSK6KRGGHS 


581 


169 


325 


-342 


SLKILSKGKRGGHSSVST 


697 


285 


329 


-346 


LSKGKRGGHSSVSTESES 


597 


185 


333 


-350 


KRGGHSSVSTESESSSFH 


579 


167 


337 


-352 


HSSVSTKSESSSFHSS 


515 


103 



These data indicate that, in addition to polypeptide 
sequences derived from positions 1-18 of the CXCR4 
receptor, the polypeptide sequences LLLTIPDFIFANVSEADD 
5 (165-182) , WFQFQHIMVGLILPGIV (197-214) , and 
IDSPILLEIIKQGCEFEN (261-278) coniprise multiple 
siibsequences, which is capable of binding to HIV-1 
envelope gpl20; 

10 Example 3 

This example identifies segments of the STRL33 
co-receptor that bind with gpl20. 

The first column in the table below indicates the 
number of the amino acid in the wild- type STRL33 

15 receptor. The second column eacplicitly identifies the 

peptide seqpaence. The third and fourth columns indicate 
the radioactive counts recorded in twenty minutes (i.e., 
the cpm X 20) after the background or non-specific counts 
had been subtracted. The fifth column contains an X in 

20 each row for which the listed polypeptide bound with high 
affinity to gpl20. The sixth and final column contains 
an X in each row wherein the listed sequence binds with 
s\ibstantial affinity but is weak in comparison to other 
samples, particularly adjacent samples. 
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Ma j or Minor 

SEP SEG PEPTIDE Activity Activity 

^-^ Peak Peak 





emntv ( control ) 


-34.5 


34.5 






T T Q 


MAEHDYHEDYGFSSFNDS 


1178.5 


1320.5 




X 


o 


DYHED YGFS SFNDSSQEE 


3357.5 


3689.5 




X 




nVGFS S PNDS SOEEHQAP 


8579.5 


8909.5 


X 




13-30 


Q c! Tnon g SOEEHOAFLOPS 


2689.5 


2757.5 




X 


17-34 


TtG QmPTTTTn AFTjOPSICVFIj 


869.5 


2152.5 




X 


21-38 




2316.5 


1819.5 




X 


25-42 


AJP JUyi? toJ^V I?iJjrV*lXlXJUV 


1421.5 


1359.5 




X 


29-46 


FSKVFXir'L.WzijV VJ? vv^wu 


534.5 


633.5 




• 


33-50 


FXiPCM i i-iV V J? V (^varXi V vsfJN O 


605.5 


372.5 






37-54 


MxL Vv FvCGXi V v»JN 0 JU V Jj V 


168.5 


235.5 






41-58 


VFVCGLVGNSIjVXiV JL fair 




284.5 






45-62 


GLVGNSLVLVI S I FYHKL 










49-66 


NS IiVLVI S I FYHIUjQS li I 




137a 5 




V 


53-70 


LVISIFYHKIjQSLTDVFXj 




t / 






57-74 


IFYHKLQSLTDVFIjVNXjJE? 




1256 5 




■V 


61-78 


KLQSIiTD VF LiVNijFijAJJJLj 




348.5 






65-82 


LTDVFLVNXjJrJaAJJij V r v u 


233.5 


336.5 






69-86 


FIjvWIjPIxAJuijVr VCXljr'i? 


70.5 


51.5 






73-90 


XiPXjiUjIjVr VV-i JLiJr 1? WAX A 


557.5 


960.5 




X 


77-94 


i-\T TTTPrrr'T'T.'DtrKrAVAnTHE 


1116.6 


1063.5 




X 


81-98 




1819.5 


1754.5 




X 


-85-102 


TDiTwav A riT WVFGOVMC 

ir J? WAXAwJuXiO mi V J. v^Sc * * ■'^ 


7262.5 


7537.5 


X 




89-106 


V7i.r« T WT71HfV"R'finVMCKSIjIj 


5911.5 


6245.5 




X 


93-110 


tnrwvpi'r^nvMCKSIjIjGIYT 


3391.5 


3466.5 




X 


97-114 


TpnnvMPTTQTjIjGTYTINFY 


1257.5 


1354.5 




X 


101-XXo 


MCKSIiIiGIYTINFYTSML 


1505.6 


1283.5 








LLGIYTINFYTSMLILTC 


499.5 


408.5 






109-126 


YTINFYTSMLILTCITVD 


351.5 


510.5 






113-130 


FYTSMLILTCITVDRFIV. 


744.5 


907.5 






117-134 


MLILTCITVDRFIVWKA 


298.5 


228.5 






121-138 


TCITVDRFIVWKATKAY 


89.5 


346.5 






125-142 


VDRFIVWKATKAYNQQA 


103.5 


53.5 






129-146 


IWVKATKAYNQQAKRMT 


166.5 


43,5 






133-150 


KATKAYNQQAKRMTWGKV 


701.5 


568.5 






137-154 


AYNQQAKRMTWGKVTSLL 


55.5 


4.5 






141-158 


QAKRMTWGKVTSLLIWVI 


-71.5 


-31.5 






145-162 


MTWGKVTSLLIWVISLLV 


-0.5 


-26.5 
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149 


-166 


KVTSLLIWISLLVSLPQ 


'39A 


5 -118.5 


153 


-170 


LLIWISLLVSLPQIIYG 


42.i 


5 75.5 


157 


-174 


VISLLVSLPQI I YGNVFN 


-60.i 


5 -127,5 


161 


-178 


LVSLPQI I YGNVFNLDKL 


91 i 


5 -15.5 


165 


-182 


PQI I YGNVFNLDKLICGY 


-18.£ 


5 -37.5 


169 


-186 


YGNVFNIiDiCLICGYHDEA 


-AM 


i -20.5 


173 


-190 


FNLDKLICGYHDEAISTV 


1072i 


) 1078.5 


177 


-194 ' 


KLICGYHDEAISTVVIiAT 


1363.£ 


1 1604.5 


181 


-198 


GYHDEAI STWLATQMTL 


754.£ 


> 1181.5 


185 


'202 


EAI STWIiATQMTLGFFL 


3973.5 


3745.5 


189 


-205 


TWLATQMTLGPFLPLLT 


2327.5 


2389.5 


193 


-210 


ATOMTLGFFLPLLTMIVC 


2365.5 


2444.5 


197 


-214 


TLGFPLPLLTMIVCYSVI 


2387.5 


479.5 


201 


-91ft 

^ -1. w 


PliPIiLTMIVCYSVI IKTIj 


1270.5 


1195.5 


Jit \J ^ 


-999 




2787.5 




209- 


-22S 


VCYSVX IKTIiLHAGGFOK 


1334.5 


1 143.5 




-9'^n 


V X X XvX XJXJAlXXwVJ^iJ ^£VnZv*^XJ 


961.5 




217- 


-9*^4 


X xixJXXiniv7Vjrj? ^xuixvwXJJiVx xf 


1041 5 






o 


artf:iTnmrH"P .tct t ft .vma 

^^varvjf ^xvxiXvOJij^x x f xi v 








-949 


^£VJnXv.OJLJX^JL X J? XJ V l*XxaV X J-txJ 




O \ *f,0 


99 Q- 


-9 


Oi_i JXx X f V vun^yi s iJXi X ^1*2 Jr 






^ w O 




T PTjVM AVPTjT jTOMPFMIiM 

Ju .C JLJ V i*lf^ V E 1 til 1 X ^1*1 JTf XVXJJ74 


386.5 


779 




.9 PA 


M A VPT .T .TnMP ITNrT .MTTP T P 
l^Ln^V f Ji.H 1 X \gjSr\.S S J.^xjtTlJXf X^ 






941 - 




T iTiTnMPPNTiM'K'P T RSTE?W 

LiIJ 1 X \gy»^Xi S ii%XM?kCSM X C^Ji^ X AvU 


2513.5 




^ ^ 3 - 






2171.5 


91ft9 


249- 


A o u 


LMKFXRSTHWEYYAMTS F 

JmJk'*Ji>m^ X<VVtJ XA^WXaX X«6^* 1 JU X 


934.5 


949.5 


253- 


•270 


I RS THWEYYAMTS FHYTI 


1571.5 


1807.5 


257- 


'274 


HWE YYAMTS FHYTIMVTE 


2040.5 


3066.5 


2S1- 


■278 


YAMTSFHYTIMVTEAIAY 


2688.5 


2359.5 


265- 


■282 


S FHYTIMVTEAI AYLRAC 


761.5 


1033.5 


269- 


286 


TIMVTEAIAYIiRACLNPV 


140.5 


272.5 


273- 


290 


TEAIAYLRACLNPVLYAF 


604.5 


480.5 


277" 


294 


AYLRACLNPVLYAFVSLK 


1802.5 


1849.5 


281- 


298 


ACLNPVLYAFVSLKFRKN 


4173.5 


4515.5 


285- 


302 


PVLYAFVSLKFRKNFWKL 


1859,5 


2147.5 


289- 


306 


AFVSLKFRKNFWKLVKDI 


808.5 


1040.5 


293- 


310 


LKFRKNFWKLVKDIGCLP 


920.5 


957.5 


297- 


314 


KNFWKLVKDIGCLPYLGV 


143.5 


82.5 


301" 


318 


KLVKDIGCLPYLGVSHQW 


-2.5 


27.5 


305- 


322 


DIGCLPYLGVSHQWKSSE 


17.5 


78.5 


309- 


326 


LPYLGVSHQWKSSEDNSK 


111.5 


122.5 


313- 


330 


GVSHQWKSSEDNSKTFSA 


208.5 


306.5 
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317 


-334 


QWKSSEDNSKTFSASHNV 


464.5 


533.5 


321 


-338 


SEDHSKTFSASHNVEATS 


524.5 


434.5 


325 


-342 


SKTFSASHNVEATSMFQL 


1524.5 


1239.5 



These data indicate that, in addition to polypeptide 
sequences derived from positions 9-26 of the STRL33 
receptor, the polypeptide sequences LVISIFYHKLQSLTDVFL 
(53-70), PFWAYAGIHEWVFGQVMC (85-102), EAISTWLATQMTLGFFL 
(185-202), LTMIVCYSVIIKTLLHAG (205-222), 
MAVFLLTQMPFNLMKFIRSTHW (237-258) , HWEYYAMTSFHYTIMVTE 
(257-274), ACLNPVLYAFVSLKFRKN (281-298) and 
SKTFSASHNVEATSMFQL (325-342) comprise multiple 
subsequences, which is capable of binding to HIV-1 
envelope gpl20. 

Example 4 

This example identifies segments of the human CD4 
protein that bind with gpl20. 

The second column in the in the table below 
identifies the amino acid residue sequence of the 
polypeptide employed in the assay. The first column 
identifies the sequence coordinates of human CD4 that 
have an identical amino acid sequence. The third column 
indicates the number of radioactive decays (i.e., coiints) 
that were counted, which is indicative of the affinity of 
the synthetic polypeptide for the gpl20 protein. In the 
table below, polypeptides retaining more than 4,000 
counts identify fragments that have a substantial 
capability to bind with gpl20. Polypeptides retaining 
more" than 6,000 counts have more substantial binding 
affinity. Polypeptides retaining at least about 10,000 
counts have a substantial and strong capacity to bind to 
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gpl20. Of course, fragments corresponding to amino acid 
coordinates 101-121 and 106-126 have a substantial^ 
strong, and dominant capacity to bind to gpl20. 



Bl 


( 1) 


1 


-21 


MNRGVPFRHLLLVLQLALLPA 


3587 


CI 


( 2) 


6 


-26 


PFRHLLLVLQLALLPAATQGK 


4356 


Dl 


( 3) 


11 


-31 


LLVLQIALLPAATQGKKWLG 


1785 


El 


( 4) 


• 16 


-36 


LALLPAATQGKKWLGKKGDT 


1759 


Fl 


( 5) 


21 


-41 


AATQGKKWLGKKGDTVELTC 


.1562 


Gl 


( 6) 


26 


-46 


KKWLGKKGDTVELTCTASQK 


1910 


HI 


( 7) 


31 


-51 


GKKGDTVELTCTASQKKSIQF 


1831 


A2 


{ 8) 


36 


-56 


TVELTCTASQKKS.IQFHWKNS 


1732 


B2 


( 9) 


41 


-61 


CTASQIOKSIQFHWKNSNQIKI 


1717 


C2 


(10) 


46 


-66 


KKSIQFHWKNSNQIKILGNQG 


2182 


D2 


(11) 


- 51 


-71 


FHWfKNSNQIKILGNQGSFLTK 


1835 


E2 


(12) 


56 


-76 


SNQIKILGNQGSFLTKGPSKL 


1487 


F2 


(13) 


61 


-81 


ILGNQGSFLTKGPSKLNDRAD 


1467 


G2 


(14) 


66 


-86 


GSFLTKGPSKLNDRADSRRSL 


1844 


H2 


(15) 


71 


-91 


KGPSKLNDRADSRRSLWDQGN 


1912 


A3 


(16) 


76 


-96 


LNDRADSRRSLWDQGNFPLI I 


1753 


B3 


(17) 


81 


-101 


DSRRSLWDQGNFPLI IKNLKI 


2224 


.C3 


(18) 


86 


-106 


LWDQGNFPLI IKNLKIEDSDT 


3264 


D3 


(19) 


91 


-111 


NFPLI I KNLKIEDSDTY I CEV 


11646 


E3 


(20) 


96 


-116 


IKNLKIEDSDTYICEVEDQKE 


8439 


F3 


(21) 


101 


-121 


lEDSDTYICEVEDQKEEVQLL 


6803 


G3 


(22) 


106 


-126 


TYI CEVEDQKEEVQLLVFGLT 


44965 


H3 


(23) 


111 


-131 


VEDQKEEVQLLVFGLTANSDT 


36249 


A4 


(24) 


116 


-136 


EEVQLLVFGLTANSDTHLLQG 


14171 


B4 


(25) 


i2i 


-141 


LVFGLTANSDTHLLQGQSLTL 


3683 


C4 


(26) 


126 


-146- 


TANSDTHLLQGQSLTLTLESP 


6114 


D4 


(27) 


-131 


-151 


THLLQGQSLTLTLESPPGSSP 


2552 


E4 


(28) 


136 


-156 


GQSLTLTLESPPGSSPSVQCR 


1538 


F4 


(29) 


141 


-161 ■ 


LTLESPPGSSPSVQCRSPRGK 


1476 


G4 


(30) 


146 


-166 


PPGSS PSVQCRSPRGKNIQGG 


1496 


H4 


(31) 


151 


-171 


PSVQCRSPRGKNIQGGKTLSV 


1400 


A5 


(32) 


156 


-176 


RSPRGKNIQGGKTLSVSQLEL 


2066 


B5 


(33) 


161 


-181 


KNIQGGKTLSVSQLELQDSGT 


• 3078 


C5 


(34) 


166 


-186 


GKTLSVSQLELQDSGTWTCTV 


2618 


D5 


(35) 


171 


-191 


VSQLELQDSGTWTCTVLQNQK 


3879 


E5 


(36) 


176 


-196 


LQDSGTWTCTVLQNQKKVEFK 


2456 


F5 


(37) 


181 


-201 


TWTCTVLQNQKKVEPKID I W 


4030 


G5 


(38) 


186 


-206 


VLQNQKKVEFKIDIWLAFQK 


9737 


H5 


(39) 


191 


-211 


KKVEPKIDIWLAFQKASSIV 


6313 


AS 


(40) 


196 


-216 


KIDI WLAFQKASS IVYKKEG 


3681 
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B6 


(41) 


201 


-221 


VLAFQKAS S I vYKKEGEQVEF 


3566 


C6 


(42) 


206 


-226 


KASS I VYKKEGEQVEFSFPLA 


14347 


D6 


(43) 


211 


-231 


VYKKEGEQVEFSFPLAFTVEK 


14740 


E6 


(44) 


216 


-236 


GEQVEFSFPLAFTVEKLTGSG 


18549 


F6 


(45) 


221 


-241 


FSFPLAFTVEKLTGSGELWWQ 


9673 


G6 


(46) 


226 


-246 


AFTVEKLTGSGELWWQAERAS 


3992 


H6 


(47) 


231 


-251 


KLTGSGELWWQAERASSSKSW 


1878 


A7 


(48) 


236 


-256 


GELWWQAERASSSKSWITFDL 


2730 


B7 


(49) 


241 


-261 


QAERASSSKSWITFDLKNKEV 


2588 


C7 


(50) 


246 


-266 


SSSKSWITFDLKNKEVSVKRV 


1761 


D7 


(51) 


251 


-271 


WITFDLKNKEVSVKRVTQDPK 


2126 


E7 


(52) 


256 


-276 


LKNKEVSVKRVTQDPKLQMGK 


2288 


F7 


(53) 


261 


-281 


VSVKRVTQDPKLQMGKKLPLH 


1848 


G7 


(54) 


266 


-286 


VTQDPKLQMGKKLPLHLTLPQ 


2075 


H7 


(55) 


271 


-291 


KLQMGKKLPLHLTLPQALPQY 


1949 


A8 


(56) 


276 


-296 


KKLPLHLTLPQALPQYAGSGN 


1922 


B8 


(57) 


281 


-301 


HLTLPQALPQYAGSGlSrLTlAL 


2394 


C8 


(58) 


286 


-306 


QALPQYAGSGNLTIALEAKTG 


2364 


D8 


(59) 


291 


-311 


YAGSGNLTLALEAKTGKLHQE 


1830 


E8 


(60) 


296 


-316 • 


NLTLALEAKTGKLHQEVNLW 


1676 


F8 


(61) 


301 


-321 


LEAKTGKLHQEVNLWMRATQ 


1729 


G8 


(62) 


306 


-326 


GKLHQEVNLWMRATQLQKNL 


1776 


H8 


(63) 


-311 


-331 


EVNLWMRATQLQiOJLTCEVW 


2183 


A9 


(64) 


316 


-336 


VMRATQLQKNLTCEVWGPTSP 


2144 


B9 


(65) 


32i 


-341 


QLQKNLTCEVWGPTSPKLMLS 


1856 


C9 


(66) 


326 


-346 


LTCEVWGPTSPKLMLSLKLEN 


2412 


D9 


(67) 


331 


-351 


WGPTSPKLMLSLKLENKEAKV 


2414 


E9 


(68) 


336 


-356 


PKLMLSLKLENKEAKVSKREK 


1656 


F9 


(69). 


341 


-361 


SLKLENKEAKVSKREKAVWVL 


1663 


G9 


(70) 


346 


-366 


NKEAKVSKREKAVWVLNPEAG - 


1735 


H9 


(71) 


351 


-371 


VSKE^KAVWVLNPEACaVIWQCL 


2034 


AlO 


(72) 


356 


-376 


KAVWVLNPEAGMWQCLLSDSG 


3133 


BIO 


(73) 


361 


-381 


USrPEAGMWQCLLSDSGQVLLE 


6316 


CIO 


(74) 


366 


-386 


OOTQCLLSDSGQVLLESNIKV 


4185 


DIO 


(75) 


371 


-391 


LLSDSGQVLLESNIKVLPTWS 


2375 


ElO 


(76) 


376 


-396 


GQVLLESNIKVLPTWSTPVQP 


2089 


FIG 


(77) 


381 


-401 


ESNIKVLPTWSTPVQPM2U:.IV 


1992 


GIO 


(78) 


386 


-406 


VLPTWSTPVQPMALIVLGGVA 


2197 


HIO 


(79) 


391 


-411 


STPVQPMALIVLGGVAGLLLF 


2527 


All 


(80) 


396 


-416 


PMALIVLGGVAGLLLFIGLGI 


3067 


Bll 


(81) 


401 


-421 


VLGGVAGT.T.T.FIGLGIFFCVR 


. 3738 


Cll 


(82) 


406 


-426 


AGLLLFIGLGIFFCVRCRHRR 


2099 


Dll 


(83) 


411 


-431 


FIGLGIFFCVRCRHRRRQAER 


1900 


Ell 


(84) 


416 


-436 


IFFCVRCRHRRRQAERMSQIK 


2085 


Fll 


(85) 


421 


-441 


RCRHRRRQAERMSQIKRLLSE 


2075 


Gil 


(86) 


426 


-446- 


RRQAERMSQIKRLLSEKKTCQ 


1607 
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Hll(87) 
A12{88) 
B12 (89) 
Al ( 0) 



431-451 
436-456 
441-458 



RMSQIKRLLSEKKTCQCPHRF 
KRLLSEKKTCQCPHRFQKTCS 
EKECTCQCPHRFQKTCSPI 
empty (control) 



2020 
1674 
2006 
2075 



Example 5 

This example shows the binding of "^I-HIV-1^ gpl20 
to the amino termini of CCR5, CXCR4, and STRL33 as a 
function of the dependence on position and length. 
Synthetic peptide arrays of nonapeptides, dodecapeptides, 
pentadecapeptides and octadecapeptides derived from CCR5 
(panel A) , C3CCR4 (panel B) and STRL33 (panel C) amino 
terminal domains were prepared and utilized to test the 
binding of "^I-HIV-1iai envelope gpl20. Ordinal sequence 
position numbers are given in accordance with the 
sequence data provided by the Genbank database for CCR5 
(accession No. gl457946, gi|l457946), CXCR4 (accession 
No. g539677, gi|400654, splP30991) and STRL33 (accession 
No. g2209288, gi 12209288). The counts shown are the 
counts detected in each well minus the background counts 
(i.e., counts observed in the assay when no polypeptide 
was bound to the well of the 96-well assay plate) . 
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Panel A Peptide Sequence Scanning Binding Results For Window Length 

Windows 

CCR5 (counts bound - background (no peptide)) 



(In each sequence row 9-, 
Initial 12-, 15-, 18-mers share the 
Sequence same initial starting point) 



# 


xxxxxxxxx 9 
xxxxxxxxxxxx 12 
xxxxxxxxxxxxxxx 15 
xxxxxxxxxxxxxxxxxx 18 


9 


12 


15 


18 


1 


MDYQVSSPreiDINYYTSE 




OA so 

ZOo^ 






2 


DYQVSSPm)I>nfYTSEP 


1552 




5401 


odoo 


3 


YQVSSPIYDINYYTSEPC 


•2533 


5305 


CA 1 c 

5415 


6119 


4 


QVSSPIYDINYYTSEPCQ 


490 


1959 


4594 


5645 


5 


VSSPIYDINYYTSEPCQK 


509 


, 1629 


3280 


3521 


6 


SSPIYDINYYTSEPCQKI 


671 




34^8 


5Zo5 


7 


SPIYDINYYTSEPCQKIN 


1503 


3463 


45 /5 


i234 


8 


PIYDINYYTSEPCQKINV 


1186 


2285 


2682 


2036 


9 


lYDINYYTSEPCQKINVK 


1359 


2702 


2516 


1261 


10 


YDINYYTSEPCQKINVKQ 


4379 


5245 


3052 


1913 


11 


DINYYTSEPCQKINVKQI 


1396 


1361 


1144 


712 


12 


INYYTSEPCQKINVKQIA 


1384 


1190 


707 


684 


13 


NYYTSEPCQKINVKQIAA 


1548 


977 


760 


595 


14 


YYTSEPCQKINVKQIAAR 


1029 


1052 


•847 


638 


15 


YTSEPCQKINVKQIA 


567 


507 


459 




16 


TSEPCQKINVKQIAA 


440 


427 


509 




17 


SEPCQKINVKQIAAR 


434 


430 


426 




IS 


EPCQKINVKQIA 


397 


432 






19 


PCQKINVKQIAA 


386 


385 






20 


CQKINVKQIAAR 


435 


581 






21 


QKINVKQIA 


453 








22 


KINVKQIAA 


487 
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Peptide Sequence Scanning 
Windows 



Panels 
CXCR4 

Qn each sequence row 9-, 12-, 15-, 18- 
j^^l^]^ mers share the same initial starting point.) 

Sequence # • 



Binding Results For Window 
Length 

(counts bound - background) 



xxxxxxxxx y 
xxxxxxxxxxxx 12 

XXXXXXKXXXXXXXX 15 

xxxxxxxxxxxxxxxxxx 18 



12 



15 



18 



1 


MEGISIYTSDNYTEEMGS 


591 


334 


3275 


2079 


2 


EGISIYTSDNYTEEMGSG 


a 


886 


7255 


1548 


3 


GISIYTSDNYTEEMGSGD 


454 


2644 


3274 


1217 


4 


ISIYTSDNYTEEMGSGDY 


466 


3973 


2202 


861 


5 


SIYTSDNYTEiEMGSGDYD 


a 


288 


168 


239 


6 


lYTSDNYTEEMGSGDYDS 


332 


335 


195 


173 


7 


YTSDNYTEEMGSGDYDSM 


181 


161 


201 


103 


8 


TSDNYTEEMGSGDYDSMK 


a 


54 


119 


38 


9 


SDNYTBEMGSGDYDSMKE 


151 


149 


124 


161 


10 


DNYTEEMGSGDYDSMKEP 


67 


121 


57 


102 


11 


NYTEEMGSGDYDSMKEPC 


a 


100 


30 


134 


12 


YIEEMGSGDYDSMKEPCF 


68 


213 


70 


103 


13 


TEEMGSGDYDSMKEPCFR 


146 


67 


23 


47 


14 


EEMGSGDYDSMKEPCFRE 


a 


61 


121 


130 


15 


EMGSGDYDSMKEPCFREE 


64 


36 


69 


64 


16 


MGSGDYDSMKEPCFREEN 


57 


68 


64 


129 


17 


GSGDYDSMKEPCFREENA 


a 


155 


172 


155 


18 


SGDYDSMKEPCFREENAN 


100 


118 


186 


89 


19 


GDYDSMKEPCFREENANF 


53 


167 


198 


134 


20 


DYDSMKEPCFREENANFN 


a 


167 


146 


75 


21 


YDSMKEPCFREENANFNK 


171 


144 


80 


89 


22 


DSMKEPCFREENANFNKI 


85 


144 


146 


40 


23 


SMKEPCFREENANFN 


a 


119 


55 




24 


MKEPCiFREENANFNK 


188 


133 


74 




25 


KEPCFREENAM?NKI 


165 


105 


93 




26 


EPCFREENANFN 


a 


69 






27 


PCFREENANFNK 


104 


108 






28 


CFREENANFNKI 


103 


66 






29 


REENANFNK 


58 
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Panel C Peptide Sequence Scanning Binding Results For Window Length 

Windows 

STRL33 -(counts bound - background) 

(In each sequence row 9-, 12-, 
Initial 15-, 18-mers share the same 
Sequence # initial starting point) 





xxxxxxxxx 9 
xxxxxxxxxxxx 12 

wmnoooDCXXXXXX 15 
wTnnnrsfWXVXXXXXXXl 8 


9 


12 


15 


18 


1 


IVt^ Htt-f f J i rl r;t ' ' X VJfX7 Owf JUNi.y0 


160 


625 


1239 


1386 


2 


A TrxrnvTnrT^Vi^F^? 5^ FNDS S 

n ' JL f X KJt>SC JU^X^kJUJ 


354 


697 


1095 


1014 


•a 


t?xxrivTTT7"nvr5"P*fl S PNDS S 0 

J2i JtlXJ X n ill 1 .-^ X w J? iStJi' x^A-' w »— ' ^ 


509 


937 


2235 


1219 


A 

*k 


imvTTTr'n Vf5FS S FND S S OE 


708 


1427 


1772 


1500 


IT 




851 


1554 


1240 


1191 


eL 
O 


YHEDYGFSSFNDSSQEEH 


728 


1950 


1357 


985 


/ 


TTtrnvriFC! qPNDSSOEEHQ 


729 


1077 


947 


537 


Q 
O 


wnvrJF 5? S FNDS S OEEHQA 


953 


817' 


1152 


548 


? 


DYGFSSFNDSSQEEHQAP 


701 


573 


595 


440 


10 


YGFSSFNDSSQEEHQAFL 


345 


745 


645 


1138 


11 


GFSSFNDSSQEEHQAFLQ 


171 


480 


270 


1639 


12 


FS S FNDS SQEEHQAFLQF 


249 


. 403 


361 


3608 


13 


SSFNDSSQEEHQAFLQFS 


243 


277 


902 


6038 


14 


SFNDSSQEBHQAFLQFSK 


304 


303 


969 


4537 


15 


FNDSSQEEHQAFLQFSKV - 


246 


470 


4089 


4678 


16 


NDS SQEEHQAFLQFS 


180 


497 


6160 




17 


DSSQEEHQAFLQFSK 


147 


882 


4588 




18 


SSQEEHQAFLQFSKV 


287 


4455 


4732 




19 


SQEEHQAFLQFS 


647 


7512 






20 


QEEHQAPLQFSK 


1109 


5672 






21 


EEHQAFLQFSKV 


6060 


5598 






22 


EHQAFLQFS 


7505 








23 


HQAFLQFSK 


2761 








24 


QAFLQFSKV 


2600 









Exannple 6 

This example shows "=I-HIV-liju, gpl20 binding to 
N-teirminal peptide variants of CCR5, CXGR4 and STRL33. 
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Octadecapeptide alanine replacement variants of 
maximum gpl20 binding activity peaks were synthesized and 
tested for "^I-HIV-1lai gpl20 binding. Each binding value 
presented is the average of two separate synthesis and 
binding experiments. Relative percentage of Control = 
{[(mean counts/Control counts)] x 100%} ± average 
deviation. .Background counts (no peptide, see Example 7) 
were subtracted from all values. Data for CCR5 are 
presented in Panel A; da.ta foir CXCR4 axe presented in 
Panel B; and data for STRL33 are presented in Panel C. 

Panel A. ^^I-HTV-li^ gpl20 binding to N-terminal peptide variants of CCSEIS 
OCRS variant peptides (1-18) Relative % of Control* 



Control 


MDYQVSSPIYDINYYTSE 


100 






MIA 


ADYQVSSPIYDINYYTSE 


167 


± 


4 


D2A 


MAYQVSSPIYDINYYTSE 


125 


± 


8 


Y3A 


MDAQVSSPIYDINYYTSE 


51 


± 


2 


Q4A 


MDYAVSSPIYDINYYTSE 


104 


± 


7 


V5A 


MDYQASSPIYDINYYTSE 


82 


± 


3 


S6A 


wdyOvaspiydinyytse 


124 


± 


3 


S7A 


MDYQVSAPIYDINYYTSE 


56 


± 


2 


P8A 


MDYQVSSAIYDINYYTSE 


157 


± 


2 


I9A 


iMDYQVSSPAYDINYYTSE 


24 


± 


7 


YlOA 


MDYQVSSPIADINYYTSE 


19 


± 


6 


DllA 


MDYQVSSPIYAINYYTSE 


63 


± 


22 


I12A 


MDYQVSSPIYDANYYTSE 


14 


± 


1 


N13A 


MDYQVSSPIYDIAYYTSE 


253 


± 


19 


Y14A 


MDYQVSSPIYDINAYTSE 


15 


± 


0.3 


Y15A 


MDYQVSSPIYDINYATSE 


21 


± 


5 


T16A 


MDYQVSSPIYDINYYASE 


78 


± 


34 


S17A 


MDYQVSSPIYDINYYTAE 


64 


± 


6 


E18A 


MDYQVSSP I YDINYYTSA 


4 


+ 


2 



^The percent binding for the wild- type peptide was defined as 100%. 
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Panel B 
CXCR4 



I-HIV-1lai gpl20 binding to N-terminal peptide variants of 

Relative % of Control* 



CXCR4 variant peptides (1-18) 





Control 


MEGISIYTSDNYTEEMGS 


100 






MIA 


AEGISIYTSDNYTEEMGS 


118 


± 18 




E2A 


MAGIS lYTSDNYTEEMGS 


36 


± 03 




G3A 


MEAISIYTSDNYTEEMGS. 


101 


± 3 




I4A 


MEGAS I YTSDNYTEEMGS 


6 


± 0.3 




S5A 


MEGIAIYTSDNYTEEMGS 


• 133 


± 5 




I6A 


MEGISAYTSDNYTEEMGS 


2 


± 1 




Y7A 


MEGISIATSDNYTEEMGS 


7 


±0.4 




T8A 


MEGI S I YASDNYTEEMGS 


97 


±10 




S9A 


MEGIS lYTADNYTEEMGS 


70 


± 4 


Oil 


DlOA 


MEGISIYTSANYTEEMGS 


71 


± 8 


63 


Nil A 


MEGISIYTSDAYTEEMGS 


38 


± 0.4 




Y12A 


MBGISIYTSDNATEEMGS 


28 


± 2 




• T13A 


MEGI S I YTSDNYl^EMGS 


70 


± 6 




E14A 


MEGI S I YTSDNYTAEMGS 


72 


± 1 


EISA 


MEGISIYTSDNYTEAMGS 


. 56 


± 7 




M16A 


MEGiSIYTSDNYTEEAGS 


88 


± 4 




G17A 


MEGI S I YTSDNYTEEMAS 


68 


± 8 




S18A 


MEGIS lYTSDNYTEEMGA 


79 


± 1 



The percent binding for the wild-type peptide was defined as 100%. 
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Panel C ^^I-HIV-1lai gpl20 binding to N-terminal peptide variants of 

STRL33 

STRL33 variant peptides (21-38) Relative % of Control' 





EEHOAPIiOPSKVPLPCMY 


100 










81 

O X 


X 






1? a tma TPT .Pit? cnrcrt?! .iDr*MV 




i 


T * 
J. 


ri23A 


££jAyAFLiyr oJxVr iiirUiYli 


Q Q 


± 


-J 

X 




aoMAAM J-lUc p Jv V r liir vJM x 


•7*5 


± 


JL 


A25A 


EEHQAFLQFSKvPLPCMx 


101 


± 


1 


F26A 


EEHQAALQFSKVFLPCMY 


32 


± 


0.1 


L27A 


EEHQAFAQFSKVFLPCMy 


37 


± 


2 


Q28A 


EEHQAFLAFSKVFLPCMY 


44 


± 


0.4 


F29A 


EEHQAFLQASKVFLPCMy 


20 


± 


1 


S30A 


EEHQAFLQFAK7FLPCMY 


92 


± 


2 


K31A 


EEHQAFLQFSAVFLPCMY 


162 


± 


2 


V32A 


EEHQAFLQFSKAFIiPCMY 


51 


± 


3 


F33A 


EEHQAFLQPSKVALPCMY 


45 


± 


2 


L34A 


EEHQAFLQFSKVFAPCMY 


76 


± 


1 


P35A 


EEHQAFLQFSKVFIACMY 


82 


± 


3 


C36A 


EEHQAFLQFSKVFLPAMY 


53 


± 


5 


M37A 


EEHQAFLQFSKVFLPCAY 


. 112, 


± 


4 


Y38A 


EEHQAFLQFSKVFLPCMA 


83 


± 


2. 



* The percent binding for the wild-type peptidis was defined as 100%. 



Example 7 . 

This example demonstrates that the binding of HIV-1 
gpl20 envelope protein to the polypeptides of the present 
invention and to the chemokine receptors from which the 
present inventive polypeptides were originally derived or 
inspired is conserved across the various species of 
HIV-1. This example also demonstrates that a step 
subsequent to initial binding of gpl20 to CCRS, CXCR4, 
STRL33, and CD4 is the most likely source of the 
phenomenon of host-range selectivity. Additionally, this 
example demonstrates that the underlying method is 
accurate in that receptor variants that are predicted to 
have an altered affinity for binding with gpl20, do in 
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fact have a statistically similar alteration in affinity 
where comparable changes in the receptors have been 
identified in other work and the affinity for binding of 
gpl20/effect on infectivity has been measured. 

This example examines the effect of particular 
mutations of CCR5 that were studied in the work 
underlying the present invention and that were also 
studied by other artisans in the field. 

The following table identifies a mutation in the 
first column. The first letter designates the wild- type 
amino acid present at the position indicated by the 
number, and the letter A which terminates all entries in 
the first column indicates that the amino acid residue 
present in that position in the mutant polypeptide is 
alaninyl. For example, the first data row (i.e., the 
second row of the table) contains the entry Y3A in the 
first column, which indicates that the tyrosine residue 
at position 3 of the wild- type CCR5 is substituted by an 
alanine residue. 

The second column provides the percentage of binding 
exhibited by a mutant polypeptide - compared to a wild- type 
polypeptide, when the methods used to elucidate the 
present invention are used in conjunction with 
radiolabeled HIV-Ijai gpl20 envelope protein. The third 
through seventh columns provide similar data that have 
been extracted from the work of others in the field using 
a strain of HIV-1 virus indicated at the top of each 
column. For example, row 2 of the following table 
indicates that when the mutation Y3A is effected in the 
human CCR5 chemokine receptor, then the resulting CCR5 
polypeptide has 51-4% of the ability to bind HIV-l^^ 



55 

gpl20 envelope protein in comparison to an equivalent 
wild-type peptide. Similarly, HIV-1^^ binds to the 
mutant polypeptide with 79% of the affinity of a 
non-mutated CCRS chemokine receptor. 





gpl20 


YU2 


ADA 


JF-RL 


89.6 


DH123 


Y3A 


51.4 


n/a 


79 


82 


n/a 


42 


Q4A 


104 


85 


132 


111 


67 


105 


YlOA 


19-2 


2 


50 


26 


10 


3 


DllA 


62.8 


2 


27 


22 


6 


3 


Y14A 


14.6 


12 


47 


25 


6 


0 


Y15A 


21 


30 


3 


3 


1 


0 


EISA 


4.1 


45 


12 


12 


3 ■ 


10 



• statistical analysis of these data indicates that 
the similarity between the binding affinity of each 
mutant peptide for gpl20 elucidated in this study is not 
more than about 25% likely to be causally xinrelated to 
the effects observed for YXJ2, and not more than about 4% 
likely to be causally unrelated to the effects observed 
for each of the other viruses listed in the table above. 

Additionally/ the affinity measurements generated by 
the underlying technique has been demonstrated to be 
accurate by (repetitively) showing that antibodies that 
specifically bind to radiolabeled gpl20 are capable of 
preventing the binding of gpl20 to polypeptides that have 
shown high affinity for binding with gpl20 in the 
experiments upon which the present invention is 
predicated. Thus, this example shows that the binding 
with chemokine receptors HIV-1 can be inhibited by the 
present inventive polypeptides, irrespective of the 
strain of HIV-l from which the gpl20 protein is obtained. 
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Example 8 

This example provides a characterization of the 
critical amino acids in the amino-teirminal segments of 
CCR5, CXCR4, and STRL33 that are essential for the 
5 ability of these polypeptides to bind with gpl20 . 

In this example, the effect on binding that occurs 
to due successive replacement of each amino acid with 
alanine is indicated, wherein a (+) signifies a decrease 
in binding affinity and a (>) signifies an enhancement in 
10 binding affinity. As is clear from inspection, the 
• sequences are shown with that amino -terminus at top and 
the carboxyl- terminus at bottom. 



CCR5 (1-18) 


CXCR4 (1-18) 


STRL33 (21-38) 


M> 


M 


E 


D 


B+ 


E 


Y++ 


Q 


H 


Q 


I+++++ 


Q 


V 


S> 


A 


S 


I++++++ 


F+++ 


S+ 


Y+++++ ■ 


L++ 


P>' 


T 


Q+ 


I+++ 


S+ 


F+++ 


Y+++ 


D+ 


S 


D+ 


N++ 


K> 


I++++ 


Y++ 


V+ 


N> • 


T 


F+ 


Y++++ 


E 


L 


Y+++ 


E++ 


P 


T 


M 


C+ 


S+ 


G 


M 


E+++++ 


S 


Y 
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ExaniTDle 9 

This example employs the same technique as Example 4 
and provides information similar to that available from 
Example 4. 

The data below compares the ability of synthetic 
fragments of CD4 to bind to labeled gpl20. 9-mer, 
12-mer, 15-mer, 18-mer, and 21-mers were selected based 
on the data from Examples 4. The relative binding 
affinities of each group of polypeptides can be 
determined by inspection of the number of counts of 
radiolabeled gpl20 that were retained by each N-mer. 
Data supporting these conclusions are provided by 
Examples 10 and 11. 



Peptide 




gpi20 




Peptide 




Gp120 


starting 


Active Peptides 


bound 




starb'ng 


Active Peptides 


Bound 


position # 




(counts) 




position # 




(counts) 




ACTIVE 9-MERS 








ACTIVE 12-MERS 




105 


DTYICEVED 


1043 




101 


lEDSDTYlCEVE 


1107 


115 


KEEVQLLVF 


1273 




112 


EDQKEEVQLLVF 


1379 


116 


EEVQLLVFG 


3170 




113 


DQKEEVQLLVFG 


1624 


117 


EVQLLVFGL 


2146 




114 


QKEEVQLLVFGL 


1785 










115 


KEEVQLLVPGLT 


1774 










116 


EEVQLLVFGLTA 


3261 










117 


EVQLIiVFGLTAN 


1838 










133 


LLQGQSLTLTLE 


1320 


217 


EQVEFSFPL 


1032 




215 


EGEQVEFSFPLA 


1456 


218 


QVEFSFPLA 


1205 




216 


GEQVEFSFPLAF 


1729 


219 


VEFSFPLAF 


1064 




217 


EQVEFSFPLAFT 


1556 










218 


QVEFSFPLAFTV 


1636 




ACTIVE 16-MERS 








ACTIVE 18-IVIERS 




109 


CEVBDQKEEVQLLVF 


1729 




105 


DTYICEVEDQKEE 


1648 












VQLLV 




110 


EVEDQKEEVQLLVFG 


2805 




106 


TYICEYEDQKEEV 


3794 












QLLVF 




111 


VEDQKEEVQLLVPGL 


3816 




107 


YICEVEDQKEEVQ 


4611 
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112 
113 
114 
115 
116 
117 



130 
131 
132 

213 
214 
216 
216 
217 
218 



90 
91 



EDQKEEVQLLVFGLT 
DQKEEVQLLVFGLTA 
QKEEVQLLVFGLTAN 
KEEVQLLVPGLTANS 
EEVQLLVFGLTANSD 
EVQLLVFGLTANSDT 



DTHLLQGQSLTLTLE 
THLLQGQSLTLTLES 
HLLQGQSLTLTLESP 

KKEGEQVEFSFPLAF 
KEGEQVEFSFPIAFT 
EGEQVEFSFPLiAFTV 
GEQVEFSFPLAFTVE 
EQVEFSFPLAFTVEK 
QVEFSFPLAFTVEKL 



ACTIVE 21-MERS 



GNFPLIIKNLKIEDS 
DTYICE 

NFPLI IKNLKIEDSD 
TYICEV 



3633 
3905 
3770 
3485 
6423 
2689 



1622 
1874 
1277 

1921 
3253 
3270 
4656 
4135 
2047 



5248 
7803 





T T XTVr* 




108 


± v^iii V AiJUJxiii Jc> Vy Jj 


0090 




JjVFGIj 




109 


L-r* V ilijy j\J2iE VQIiJj 






VFGLT 




1 lU 


EVEDQKEEVQLLV 


oo47 




FGLTA 




4 4 4 


VEDQKEEVQLLVF 


3913 




GLTAN 




112 


EDQKEBVQLLVFG 


3416 




LTANS 




113 


DQKEEVQLLVFGL 


3317 




TANSD 




114 


QKEEVQLLVFGLT 


3671 


• 


ANSDT 




127 


ANSDTHLLQGQSL 


1540 




TLTLE 




128 


NSDTHLLQGQSLT 


1726 




LTLES 




129 


SDTHLLQGQSLTIi 


1260 




TLESP 




210 


IVYKKEGEQVEFS 


5382 




PPLAF 




211 


VYKKEGEQVEFSP 


4307 




PLAFT 




212 


YKKEGEQVEFSFP 


4839 




LAFTV 




213 


KKEGEQVEFSFPL 


4683 




^ L,li 111 T'lTI 




214 


KEGEQVEFSFPLA 


3117 




FTVEK 




215 


EGEQVEFSFPLAF 


2164 




TVEKL 




216 


GEQVEFSFPLAFT 


1643 




VEKLT 





92 FPLIIKNLKIKDSDT 
YICEVE 

93 PLIIKNLKIEDSDTY 
ICEVED 

94 LIIKNLKIEDSDTYI 
CEVEDQ 

95 IIKNLKIEDSDTYIC 
EVEDQK 

96 IKNLKIEDSDTYICE 
VEDQKE 

97 KNLKIEDSDTYICEV 
EDQKEB 

99 LKIEDSDTYICEVED 
. QKEEVQ 

100 KIEDSDTYICEVEDQ 
KEEVQL 

101 lEDSDTYICEVEDQK 
EEVQLL 

102 EDSDTYICEVEDQKE 
EVQLLV 

103 DSDTYICEVEDQKEE 
VQLLVF 

104 SDTYICEVEDQKEEV 
QLLVFG 

105 DTYICEVEDQKEEVQ 
LLVFGL 

106 TYICEVEDQKEEVQL 
LVFGLT 

107 YICEVEDQKEEVQLL 
- VFGLTA 

108 ICEVEDQKEEVQLLV 
PGLTAN 

109 CEVEDQKEEVQLLVP 
GLTANS 

123 FGLTANSDTHLLQGQ 
SLTLTL 

124 GLTANSDTHLLQGQS 
LTiiTLE 

125 LTANSDTHLLQGQSL 
TLTLES 

126 TANSDTHLLQGQSLT 
LTLESP 
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13919 
20145 
17108 
11892 
15073 
8789 

5519 

6325 
12064 

4933 
30277 
30319 
25424 
20191 
22884 

7276 

3517 

11529 
1406i5 
17113 
23595 
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204 


FOKAS SIVYKKEGEQ 


9382 




V CUl? 0£ JET 




205 


OKAS S I VYKKEGEQV 


24959 




-rtTpQTPPT. 
2ii7 Oi? IT J-t 




206 


K2^ S I VYKKEGEQVE 


30873 








207 


ASS T VYKKEGEOVEP 


25146 




SFPLAF 




208 


SS I VYKKEGEQVEPS 


28068 




FPIiAFT 




. 209 


S I VYKKEGEQVEPS P 


oioo 




PLAFTV 




210 


IVYKKEGEQVEFSPP 






lAFTVE 




221 


FSFPIjAFTV£iJnJj X vstiD 


4163 




GELWWQ 




222 


S FPIiAFTVEKLiTGovj 


2284 








223 


FPLAFTVEKLTGSGE 


6276 




LWWQAE 




224 


PIiAFTVEKLTGSGEIi 


2647 




WWQAER 




225 


LAFTVEKLTGSGELW 


3577 




WQAERA 





Example 10 

This example provides data which enables those 
skilled in the art to arrive at the conclusions indicated 
in Examples 9 and 12. In this exan^le, the counts of 
radiolabeled gp-120 retained by each peptide indicated in 
the left hand column are given in the right hand . column . 
The. first panel (panel A) provides data for 21-mers of 
CD4. 



Panel A 
PEPTIDE 



COtJNTS 



LWDQGNFPLI IKNLKIEDSDT 
WDQGNFPLI IKNLKIEDSDTY 
DQGNFPLI IKNLKIEDSDTYI 



731 
889 
1138 
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QGNPPLIIKNLKIEIPSDTYIC 2242 

GNFPLIIKNLKIEDSDTYICE 5248 

NFPLIIKNLKIEDSpTYICEV 7803 

FPLIIKNLKIEDSDTYICEVE 13919 

PLIIKNLKIEDSDTYICEVED 20145 

LIIKNLKIEDSDTYICEVEDQ 17108 

IIICNLKIEDSDTYICEVEDQK 11892 

IKNLKIEDSDTYICEVEDQKE 15073 

KNLKIEDSDTYICEVEDQKEE 8789 

NLKIEDSDTYICEVEDQKEEV 2016 

LKIEDSDTYICEVEDQKBEVQ 5519 

KIEDSDTYICEVEDQKEEVQL 6325 

lEDSDTYICEVEDQKEEVQLL 12064 

EDSDTYICEVEDQKEEVQLLV 4933 

DSDTYICEVEDQKEEVQLLVF 30277 

SDTYICEVEDQKEEVQLLVFG 30319 

DTYICEVEDQKEEVQLLVFGL 25424 

TYICEVEDQKEBVQLiVFGLT 20191 

YICEVEDQKEEVQLLVFGLTA 22884 

ICEVEDQKEEVQLLVFGIiTAN 7276 

CEVEDQKEEVQLLVFGLTANS 3517 

EVEDQKEEVQLLVFGLTANSD 1687 

VEDQKEEVQLLVFGLTANSDT 646 

EDQKEEVQLLVFGLTANSDTH 562 

DQKEEVQLLVFGLTANSDTHL 599 

QKEEVQLLVFGLTANSDTHLIi 573 

KEEVQLLVFGLTANSDTHLLQ 682 

EEVQLLVFGLTJ^NSDTHLLQG 690 

EVQLLVFGLTANSDraLLQGQ ■ 589 

VQLLVFGLTANSDTHLLQGQS 1099 

QLLVFGLTANSDTHLLQGQSL ' 2057 

LLVFGLTANSDTHLLQGQSLT . 860 

LVFGLTANSDTHLLQGQSLTL 4677 

VFGLTANSDTHLLQGQSLTLT 2762 

FQLTANSDTHLLQGQSLTLTL 11529 

GLTANSDTHLLQGQSLTLTLE 14065 

LTANSDTHLLQGQSLTLTLES 17113 

TANSDTHLLQGQSLTLTLESP 23595 

Empty (Control)- 515 

TWTCTVLQNQKKVEFKIDIW 1430 

WTCTVLQNQKKVEFKIDIWL 1616 

TCTVLQNQKKVEFKIDIWLA 1092 

CTVLQNQKKVEFKIDI WLAF 2909 

TVLQNQKKTEFKIDIWLAFQ 3273 

VLQNQKKVEFKIDIWIAPQK 1323 
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LQNQKKVEPKIDIWLAFQKA 1256 

QNQKKVEPKIDIWLAFQKAS 1808 

NQKKVEFKIDIWLAFQKASS 1507 

QKKVEFKIDIWLAFQKASSI 759 

KKVEFKIDIWLAPQKASSIV 782 

KVEPKIDIWLAFQKASSIVY 635 

VEFKIDIWLAFQKASSIVYK 725 

EFKIDIWLAFQKASSIVYKK 649 

FKIDIWLAFQKASSIVYKKE 593 

KIDIWLAFQKASSIVYKKEG 1394 

IDIWLAFQKASSIVYKKEGE 962 

DIWLAFQKASSIVYKKEGEQ 788 

_ IWIilVFQKASSIVYKKEGEQV . 646 

WIiAFQKASSIVYKKEGEQVB • • 772 

VLAFQKZ^SIVYKKEGEQVEF 1793 

g LAPQKASSIVYKKEGEQVEFS 1410 

Sj AFQKASSIVYKKEGEQVEFSF 3775 

Jf; FQKASSIVYKKEGEQVEFSFP 9382 

5 QKASSIVYKKEGEQVEFSFPL 24959 

M . KASSIVYKKEGEQVEFSFPLA 30873 

W ASSIVYKKEGBQVEFSFPLAF 25146 

SSIVYKKEGEQVEFSFPLAFT 28068 

l-i SIVYKKEGEQVEFSFPLAFTV 8165 

?y IVYKKEGEQVEFSFPLAFTVE 15620 

^4 VYKKEGEQVEFSFPLAFTVEK 2429 

O YKKEGEQVEFSFPIiAFTVEKL 735 

rj KKEGEQVEFSFPLAFTVEKLT 1847 

iCEGEQVEFSFPLAFTVEKLTG 972 

EGEQVEFSFPLAFTVEKLTGS . .739 

GEQVEFSFPLAFTVEKLTGSG 652 

EQVEFSFPLAFTVEKLTGSGE 765 

QVEFSFPLAFTVEKLTGSGEL 741 

VEFSFPIAFTVEKLTGSGELW 633 

EFSFPIiAFTVEKLTGSGELWW 681 

FSFPLAFTVEKLTGSGELWWQ 4163 

SFPLAFTVEKLTGSGELWWQA 2284 

FPLAFTVEKLTGSGELWWQAE 6276 

PLAFTVEKLTQSGELWWQAER 2647 

lAFTVEKLTGSGELWWQAERA 3577 

AFTVEKLTGSGELWWQAERAS 1739 

Etr^ty (control) 617 



63 

These second and third panels (panels B and C) provide data for 18-1 
small region of CD4. 



Panel B 

PEPTIDE COUNTS 



LWDQGNFPLI IKNLK 


502 


WDQGNFPLI IKNLKI 


534 


DQO^FPLI IKNLKIE 


635 


QGNFPLI IKNLKI ED 


509 


caTFPLIIKNLKIEDS 


624 


NFPLIIKJSTLKIEDSD 


654 


FPLI IKNLKIEDSDT 


539 


PLIIKNLKIEDSDTY 


661 


LI IKNLKIEDSDTYI 


542 


I IKNLKIEDSDTYIC 


664 


IKNLKIEDSDTYICE 


568 


KNLKIEDSDTYICEV 


562 


NLKIEDSDTYICEVE 


1160 


LKIEDSDTYICEVED 


846 


KIEDSDTYICEVEDQ 


1088 


lEDSDTYICEVEDQK . 


1143 


EDSDTYI CEVEDQKE 


815 


DSDTYICEVEDQKEE 


973 


SDTYICEVEDQKEEV 


993 


DTYICEVEDQKEEVQ . 


1071 


TYICEVEDQKEEVQL 


956 


YICEVEDQKEEVQLL 


1064 


ICEVEDQKEBVQLLV 


1084 


CEVEDQKEEVQLLVF 


1729 


EVEDQKEEVQLLVFG 


2805 


VEDQKEEVQLLVFGL 


3816 


EDQKEEVQLLVFGLT 


3633 


DQKEEVQLLVFGLTA 


3905 


QKEEVQLLVFGLTAN 


3770 


'KEEVQLLVFGLTANS 


3485 


EEVQLLVFGLTANSD 


6423 


EVQLLVFGLTANSDT 


2689 


VQLLVFGLTANSDTH 


1006 


QLLVFGLTANSDTHL . 


865 


LLVFGLTANSDTHLL 


599 


LVFGLTANSDTHLLQ 


609 


VFGLTANSDTHLLC3G 


532 


FGLTANSDTHLLQGQ 


625 
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HT .TZi'NTQ'nrntiT .T .rvtinG 


Ooz 


T.TSV'MQT^'TOT T ninr\ClT 


Oo4 


T^i'M'cnTtJT T r\r*r\csT t 


O lO 


AIN o JJ i xl LiJjQGQ S LiT Jj 


542 


W bD 1 jcIJjIjQGQSIjTIjT 


631 


oIJrHIjIjQGQSIiTIiTIi 


747 


DTHLLQGQSLTLTLE 


1622 


THLLQGQS LTLTLES 


1874 


HLLQGQSLTLTLESP 


1277 


^ LWDQGNFPLIIKNLKIED 


582 


WDQCTTPPLI IKNLKIEDS 


626 


DQGNPPLI IKNLKIEDSD 


598 


QGNFPLI IKNLKIEDSDT 


564 


GNFPLI IKNLKIEDSDTY 


567 


NFPLIIKNLKIEDSDTYI 


627 


FPLIIKNLKIEDSDTYIC 


509 


PLIIKNLKIEDSDTYICE 


624 


LIIKNLKIEDSDTYICEV 


634 


I IKNLKIEDSDTYICEVE 


751 


IKNLKIEDSDTYICEVED 


699 


KNLKIEDSDTYICEVEDQ • 


708 


NLKIEDSDTYI CEVEDQK 


863 


LKIEDSDTYICEVEDQKS 


872 


KIEDSDTYICEVEDQKEE 


858 


lEDSDTYICEVEDQKEEV 


1230 


EDSDTYICEVEDQKEEVQ 


788 


DSDTYICEVEDQKEEVQL 


961 


* SDTYICEVEDQKEEVQLL 


870 


DTYICEVEDQKEEVQLLV 


1648 


TYICEVEDQKEEVQLLVP 


3794 


YICEVEDQKEEVQLLVFG 


4611 


' ICEVEDQKEEVQLLVFGL 


3898 


CEVEDQKEEVQLLVFGLT 


3797 


EVEDQKEEVQLLVFGLTA 


3647 


VEDQKEEVQLLVFGLTAN 


3913 


EDQKEEVQLLVFGLTANS 


3416 


DQKEEVQLLVFGLTANSD 


3317 


QKEEVQLLVPGLTANSDT 


3671 


KEEVQLLVFGLTAcTSD^ra 


'a ^ta 

1271 


EEVQLLVFGLTANSDTHL 


783 


EVQLLVFGLTJyNSDTHLL 


667 


VQLLVFGLTANSDTHLLQ 


673 


QLLVFGLTANSDTHLLQG 


574 


LLVPGLTANSDTHLLQGQ 


568 


LVFGLTANSDTHLLQGQS 


564 



VFGLTANSDTHLLQGQSL 
FGLTANSDTHLLQGQSLT 
GLTANSDTHLLQGQSLTL 
LTANSDTHLLQGQSLTLT 
TANSDTHLLQGQSLTLTL 
ANSDTHLLQGQSLTLTLE 
NSDTHLLQGQSLTLTLES 
SDTHLLQGQSLTLTLESP 
Enrpty (control) 

Panel C 
PEPTIDE 

WTCTVLQNQKKVEFK 
TCTVLQNQKKVEFKI 
CTVLQNQKKVEFKID 
TVLQNQKKVEFKIDI 
VLQNQKKVEFKIDIV 
LQNQKKVEFKIDIW 
QNQKKVEFKIDIWL 
NQKKVEFKIDIWLA 
QKKVEFKIDIWLAF 
KKVEFKIDIWLAFQ 
KVEFKIDIWLAFQK 
VEFKIDIVVLAFQiCA 
EFKIDIWLAFQKAS 
FKIDIWLAFQKAS S 
KIDIVVIAFQK?^S I 
IDI WLAFQKZ^S IV 
DIWLAFQKASSIVY 
IWLAFQKASSIVYK 
WIAFQKASS IVYKK 
VLAFQKASSIVYKKE 
LAFQKASSIVYKKEG 
AFQKASSIVYKKEGE 
FQKASS IVYKKEGEQ 
QKASSIVYKKEGEQV 
KASS IVYKKEGEQVE 
ASSIVYKKEGEQVEF 
SSIVYKKEGEQVEFS 
S I VYKKEGEQVEFS F 
I VYKKEGEQVEFS FP 
VYKKEGEQVEFSFPL 
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531 
591 
572 
528 
891. 
1540 
1726 
1260 
575 



COUNTS 

566 
510 
608 
587 
605 
644 
636 
860 
1333 
951 
. 1051 
1005 
1188 
1001 
956 
865 
776 
783 
577 
634 
593 
544 
637 
519 
563 
589 
558 
651 
615 
714 





oo 


YKKEGEQVEPSPPLA. 


687 


KKEGEQVEPSFPLAF 


1921 


KEGEQVEPSPPLAFT 


3253 


EGEQVEFS PPLAFTV 


3270 


GEQVEFS FPLAFTVE 


4656 


EQVEFSPPLAFTVEK 


4135 


QVEFSPPL2^PTVEKL 


2047 


VEFS PPLAFTVEKLT 


899 


EFSFPLAFTVEKLTG 


920 


FSFPLAFTVEKLTGS 


672 


SPPLAPTVEKLTGSG 


565 


FPLAFTVEKLTGSGE 


556 


PLAFTVEKLTGSGEL 


612 


IiAFTVEKLTGSGELW 


579 


AFTVEKLTGSGELWW 


586 


FTVEKLTGSGELWWQ 


625 


TVEKLTGSGELWWQA 


550 


VRKLTGSGELWWQAE 


735 


EKLTGSGELWWQAER 


683 


WTCTVLQNQKKVEPKIDI 


588 


TCTVLQNQKKVEPKIDIV 


571 


CTVLQNQKKVEFKIDIW 


553 


TVLQNQKKVEFKIDIWL 


655 


VLQNQKKVEFKIDIWIiA 


724 


LQNQKKVEPKIDIWLAF 


938 


QNQKKVEFKIDIWLAFQ 


917 


NQKKVEFKIDIWLAFQK 


889 


QKKVEFKIDIWLAFQKA 


1013 


KKVEFKIDIWLAFQKAS 


912 


KVEFKIDIWLAPQKASS 


1011 


VEFKIDIVVLAFQKASSI 


819 


EFKIDIWLAFQKASSIV 


799 


FKIDIWLAFQKASS IVY 


843 


KIDIWLAPQKAS SIVYK 


779 


IDIWLAFQKASS IVYKK 


711 


DIWLAFQKASSIVYKKE 


660 


IWLAFQKASS IVYKKEG 


531 


WLAFQKASSIVYKKEGE 


560 


VLAFQKASS IVYKKEGEQ 


549 


LAFQKASSIVYKKEGEQV • 


665 


AFQKAS S I VYKKEGEQVE 


514 


FQKASSIVYKKEGEQVEF 


528 


QKASS IVYKKEGEQVEFS 


602 


KASSIVYKKEGEQVEPSF 


536 


ASS IVYKKEGEQVEFSFP 


701 
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SS IVYKKEGEQVEFSFPL 


756 


SIVYKKEGEQVEFSFPLA 


771 


IVYKKEGEQVEFSFPLAF 


5382 


VYKKEGEQVEFSFPLAFT 


4307 


YKKEGEQVEFSFPIAFTV 


4839 


KKEGEQVEFSFPLAFTVE 


4683 


KEGEQVEFSPPIAPTVEK 


3117 


EGEQVEFSFPIAFTVEKL 


2164 


GEQVEFSFPLAFTVEKLT 


1643 


EQVEFSPPLAFTVEKLTG 


798 


QVEFS FPIAFTVEKLTGS 


736 


VEFSPPLAFTVEKLTGSG 


533 


EFSFPLAFTVEKLTGSGE 


668 


FSFPLAFTVEKLTGSGEL 


613 


SFPLAFTVEKLTGSGELW 


656 


FPIAFTVEKLTGSGELWW 


586 


PLAFTVEKLTGSGELWWQ 


650 


LAFTVEKLTGSGELWWQA 


866 


AFTVEKLTGSGELWWQAE 


788 


FTVEKLTGSGELWWQAER 


1143 


Empty (control) 


556 



fij The fourth and fifth panels (Panels D and E) provide data for select 9-mers and 

y 12Hmersof CD4. 



5 Panel D 

PEPTIDE COUNTS 



DQGNFPLII 


662 


QGNFPLIIK 


508 


GNFPLIIKN 


600 


NPPLIIKNL 


561 


FPLIIKNLK 


601 


PLIIKNLKI 


697 


LIIKNLKIE. 


515 


IIKNLKIED 


658 


IKNLKIEDS ■ 


557 


KNLKIEDSD 


612 


NLKIEDSDT 


512 


LKIEDSDTY 


492 


KIEDSDTYI 


603 


lEDSDTYIC 


567 


EDSDTYICE 


650 


DSDTYICEV 


712 



SDTYICEVE 


819 


DTYICEVED 


1043 


TYICEVEDO 


805 


YICEVEDOK 


728 


T CEVEDOKE 

aX> N » 1 1 V li il JL/^j{<l>> f t 


596 


CEVEDOKEE 


555 

www 


EVEDOKEEV 


587 

WW / 


VEDOKEEVO 


521 

w4. 1 




wu*r 


X^^IVXiCj V ^JLtU 




JcU V v-i-J JLi V 


OOO 


JnJIi Jtii V y iilj V r 




ilijCi VyijijVr O 


O 1 / U 


EVQIjIjVFGL 


2146 


VQLLVFGLT 


815 


QLLVFGLTA 


822 


LLVFGLTAN 


676 


LVPGLTANS 


522 


VFGLxANSD 


549 


FGLTANSDT 


coo 

563 


rmv T^T<i 1 MUTT 

GLTANSDTH 


481 


LTANSDTHL 


596 


TANSDTHLL 


554 


ANSDTHLLQ 


642 


NSDTHLLQG 


561 


SDTHLLQGQ 


526 


T~\rTTTTT T f^^t^d 

DTHLLQGQS 


O/O 


TJtlLIjQGySIj 


D iZ 


XIT T f\nr\QX rp 

xiijXjy vjyoLi X 




LIjQGQS liTL 


OOO 


IjQGQSIjTIjT 


OU 1 


QGQSiiTLiTL 


o94 


GySIjTLiTIjK 


(ft 


DQGNFPLiI I KNIi 


€i.r\A 
OU4 


l^/^ATimT TXVTVTT 

yiaiMf ir'JLiXXJxlNijJx 


OOO 










PPT.T TTrNTT.TTTTm 


\J 1 1 


"DT.TTinvrT TTT'cnr^Q 
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Empty 571 
(control) 

Panels F and G provide data on sequential alanine 
' replacements for selected CD4 polypeptides . 
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ZZZZZZZSLTLTLES 1086 

ZZZZZZZZLTLTLES 1201 

THLLQGQSLTLTLES 9756 

GEQVEFSFPLAFTVE 18856 

GEQVSFSFPLAFTVZ 16222 

GEQVEFSFPLAFTZZ 12535 

GEQVEFSFPLAFZZZ " 11384 
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GEQVEPSPPZZZZZZ 2208 
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ZZQVEFSFPLAFTVE . 18684 
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ZZZZZPSFPLAFTVE 2134 

ZZZZZZSFPIiAFTVE 1454 

ZZZZZZZFPLAFTVE 1391 

ZZZZZZZZPLAFTVE 1489 

GEQVEFSFPLAFTVE 18867 

empty (control) 580 



Example 11 

This example characterizes CD4 receptor sequences foxmd 
to have HIV gpl20 binding activity in screening tests. 
Panel A displays information obtained from sequential 
replacement of amino acid residues byalaninyl residues. 
Iii panel A, a (+) signifies a decrease in binding 
affinity whereas a (>) indicates that replacement of the 
residue by an alaninyl residue yields an increase in 
binding affinity. Sequences are shown with amino- 
terrainus at the top and the carboxyl- terminus at the 
bottom. Right and left sides are from independent 
assays . 
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Panel B indicates the effect on binding affinity when 
successive amino acid residues are deleted, either from 
the amino -terminus (right side-symbols) or the carboxyl- 
terminus from the bottom (left side-symbol) . A (+) 
signifies a decrease in binding affinity, and the 
underlined residues indicate which residue was the last 
residue to be serially deleted. 
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All publications cited herein are hereby 
incorporated by reference to the same extent as if each 
pxiblication were individually and specifically indicated 
to be incorporated by reference and were set forth in its 
entirety herein* 

While this invention has been described with an 
emphasis upon preferred embodiments, it will be obvious 
to those of ordinary skill in the art that variations of 
the preferred embodiments can be used and that it is 
intended that the invention can be practiced otherwise 
than as specifically described herein. Accordingly, this 
invention includes all modifications- encompassed within 
the spirit and scope of the invention as defined by the 
following claims. 



